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— INTRODUCTION — 
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1. Why this Guide? 


The European Commission has been promoting the use — and sharing — of open-source software within its Framework 
Programmes on Research and Technological Development — 7‘ FP and presently Horizon 2020 — as research 
applications developed with EU co-financing must, in many cases, be made available as FOSS. Of particular interest to 
translation, are the projects in the field of Machine Translation and CAT tools like Moses (Euromatrix(Plus), MosesCore), 
MateCat and Casmacat. 


The European Commission Open Source Strategy also stresses that “the Commission services will increasingly 
participate in open source software communities to build on the open source building blocks which are used in the 
Commission's software”. It is therefore in this context that open-source applications like OmegaT are being used in DGT. 


OmegaT (OT) is a free open-source CAT Tool that was developed, by private initiative, originally by Keith Godfrey in 
2000 and that has been vastly improved since then with many contributions (See OmegaT— Help — About for a list of 
contributors). Didier Briel is its present project manager. OmegaT is now the leader open-source CAT tool. 


OmegaT was used by DGT in 2012 for prototyping prior to the acquisition of a commercial CAT tool at the end of that 
year. For that purpose, the 2.6.0._3 version of OmegaT was customized and extended to integrate other DGT tools. 


This version of Omegat is internally referred to as DGT-OmegarT to differentiate it from the public version. Furthermore, 
DGT developed in-house the OmegaT Project Wizard to integrate Omega’ in its workflow. 


In June 2012, DGT-OmegatT and its Project Wizard were made available to all DGT translators interested in trying/using 
it and it has been used by some translators ever since as an alternative to the main DGT CAT tool. 
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Because DGT-OmegaT has some variations compared to the public version, | wrote a guide to take into consideration 
OmegatT’s in-house new/improved/adapted features and its Project Wizard and also DGT’s workflow and work methods. 


In the meantime DGT has updated its OmegaT version with the 3.1.2 public version and developed in-house the 
TeamBase application which allows the sharing of project memories among translators who can be using either of the 
CAT tools available in DGT. 


So this Guide is now updated to include all the changes and improvements (DGT-OmegaT-3.1.2_3+DGT Extensions 2.3. 
beta update 5). 


To reflect its widened scope, it has also been renamed DGT-Omegaf, its Project Wizard and DGT’s CAT Environment 
—A Translator’s Guide to stress that a CAT tool is just a part of an integrated working environment. 


As the Guide has grown quite a lot — both in scope and size — mainly due to a substantial number of OT new features, 
| also wrote a Quick Guide that can be accessed via the DGT-OmegaT Project Wizard or here. 


oe 
HN | had also posted 3 videos for the 2012 version of DGT-OmegaT (50 minutes on the whole: Video 1, Video 2 and 
Video 3 — DGT-OMEGAT and its Project Wizard in a nutshell) as it is so much easier to show than to explain! 


A substantial part of the videos continues to apply to the new version of DGT-Omegav. 


2. How to install DGT-OmegaT 


If you wish to try/use DGT-Omegar, you can install it from here. Just select this application and click on Install to have 
the latest release of the DGT-customized OmegaT — that includes the Project Wizard and TeamBase — installed in your 
service PC/laptop in a few minutes. 


Furthermore, if you are not a teleworker, but you would like to be able to work at home in your private computer — as | 
sometimes do — you can simply copy/paste DGT-Omegar? to it (or ask for help to do it), taking into consideration, of 
course, that you won't have access to DGT databases. 


As itis a free open-source CAT tool, there are no extra licence costs involved. 


3. HelpDesk support and training 


You can request HelpDesk support from the IT Unit as for any other DGT application. 


You can also request training. 


4. Purpose of this Guide 


The main purpose of this Guide is to introduce DGT translators to DGT-OmegaT and its Wizard in the perspective of 
DGT workflow and work methods. 


This Guide aims to give detailed information about DGT-OmegaT’s features, highlighting those | consider more important 
for translators in our working environment and — hopefully — to help you to easily translate your (single or 
multi-document) projects taking full advantage of its powerful features. 


But this Guide is also meant to be useful for DGT newly recruited translators or trainees who are not familiar with DGT 
workflow and/or any CAT tool. Therefore, it also gives an overview of DGT’s CAT Environment and suggestions on how 
to make the best use of the resources available. 


As this Guide is meant for “absolute beginners” — both in terms of CAT tools and DGT CAT environment — | hope DGT 
old-timers and CAT tool advanced users will forgive me for explaining what may be (for some/many) so obvious... 
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5. Approach in this Guide 


| present here my personal view after using DGT-OmegaT for 3 years in all my translations. However, take into 
consideration that | translate mainly Word, and sometimes Excel and PowerPoint, documents and therefore | have no 
experience with formats like the ones used, for instance, for web pages. 


In this Guide, | assume that you have read the Quick Guide and therefore have a general idea of how to work 
with DGT-Omegat, its Wizard and TeamBase. The Quick Guide may even be enough if you prefer to keep to the 
essentials. 


However, for large and complex projects, it may be worthwhile to take the time to get more detailed information, at least 
about the features that are (more) important to you. 


Considering that the bulk of my work is translating large multi-document projects, | give a lot of attention to the 
management of complex projects in this Guide. 


As in DGT we have the privilege of having an IT Unit that takes care of all the technical details — installation, plugins, 
scripts, defaults, compatibility, etc. — this Guide focus merely on the translation process and reflects the way | use 
OmegaT to translate documents in Office format — which are the huge majority of documents translated in DGT — and 
of course it is not meant to cover every feature and possible use of Omegat. 


For this Guide to make sense on its own, it is necessary to repeat basic information already given in the Quick Guide, 
which is further developed in the relevant detailed sections. 


Considering that the preparation and management of projects is very important to make your work easier, | reorganised 
the update of this Guide following a different approach, by starting with the basics and gradually giving more detailed 
information in thematic sections so that you can choose the level of detail you are interested in according to your needs. 


The main parts are: 

y Anoverview of DGT’s CAT Environment 

y Anoverview of DGT-OmegaT main features and how to work with OmegaT and its Wizard 
y Detailed information concerning the DGT-OT project structure 
> 


Information about the preparation, creation, update and general management of projects, working in shared 
mode with other translators in real time (TeamBase) and revising with DGT-OT 


y A section on “Menus Explained” so that you can have a quick snapshot of all the features available in 
DGT-OT menus, with shortcuts and icons 


y Thematic sections on how to explore OmegaT’s features: translation memories and machine translation, 
search features (concordance), terminology, notes, auto-completion, formatting/tags, spellchecker, language 
checker and quality assurance, revision process with variants and attributes customization 


y Assection on troubleshooting — although OT is a fairly stable and trouble-free application — and a list of 
shortcuts 


y AnAnnex on Machine Translation — taken from the Moses for Mere Moses Tutorial — which gives a general 
view of what Statistical Machine Translation is about, so that you have an idea why Machine Translation 
output is sometimes so surprisingly good ... and other times pure rubbish! 


As with OmegaT you can manage very large projects with many documents, many memories and large glossaries, it 
may be really worthwhile for you to explore its full potential, so much so that there are no speed or capacity problems in 
handling them with Omegat in our service computers. 
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6. Symbols used in this Guide 
| use some symbols in this Guide to call your attention to certain aspects. 


MPP To mark features which are fully DGT-specific or have only some adaptations/improvement as compared to the 
public version. It is also used to mark applications (linguistic or documental) which are developed by EU 
institutions. 


be) To indicate that it is a feature that is new in DGT-OmegaT 2014 (either DGT-specific or not) as compared to 
DGT-OmegaT 2012, be it a completely new feature or an improvement on an existing feature. 


“® To indicate that there is further information in another section of the present Guide or on the Internet. 


& To indicate that it is a tip or personal comment. 


a To indicate that you should pay attention to a particular detail. 


Troubleshooting: Simple ways to solve problems ... hopefully! 


7. More information 


DGT-OT Guides (available via the DGT-OT Wizard): 
y The DGT-OmegaT-2014 and its Project Wizard — Quick Guide 


yY DGT-OmegaT 2014, its Project Wizard and DGT’s CAT Environment — A Translator’s Guide (the present 
Guide), which updates the 2012 version. 


yY DGT-Omegat and its Project Wizard — A Translator’s Guide (2012). For translators who were already using the 
2012 version and might want to look at it. 


For more information on the public OmegaT features or on features not (sufficiently) covered in this Guide: 
y Public OmegaT Guides (available via the DGT-OT Wizard and in the public OmegaT website): 
q OmegaT 3.0 — User's Guide by Vito Smolej (the Guide in the OmegaT Help in pdf format for easier 
consultation) 
q Omegal for CAT Beginners by Susan Welsh & Marc Prior 
y Public sources: 
q_ The public OQmegaT website and its very active User Group 


q The OmegaT documentation page also gives you further information on available guides, training videos, 
blogs, etc. 


aad Take into consideration that DGT-OmegaT has substantial adaptations! 
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9, New/improved/adapted features and applications 


In this Section you will find a list of the new features of the present version of the public OmegaT (3.1.2.) and also of 
those that have been added, improved or adapted to integrate it in DGT environment, both in relation to the 2012 
implementation of OmegaT in DGT. 


The DGT-specific new/improved/adapted features are identified with the DGT logo in this list and throughout this Guide. 


oat If you look for information in the public OmegaT Guides, take into consideration that some features are different 
in DGT-Omegat. 


In this Section you also find a list of the applications directly linked to OmegaT which are DGT-specific and are new or 
improved. 


=a DGT applications Short description 


GF DGT-Omega¥ Project Integrates OmegaT in DGT workflow by making the link between 
Wizard Tradesk, Windows Explorer and DGT-Omegal. It allows to easily 
create and update multi-document projects with translation 

memories and IATE extractions. Vastly improved from the 2012 


version. 
‘GF = TeamBase New server application to share project memories in real time in 
= read or read/ write mode. 

GF TagWipe A tag cleaning script developed in-house for Word documents to 
eliminate useless inline tags. It has been improved from the 2012 
version. 

Project folder — Short description 
Preferences 

ee 4=_CONFIG-PERSONAL This folder is automatically created — under the OmegaT_Projects 

i folder folder — when DGT-Omegat is installed. 
iF Omegat.prefs and Your preferences are stored in the CONFIG-PERSONAL folder to be 

_ uiLayout easily accessible to you. 
bie learned_words and These personal “dictionary” files are also stored in the 
-__ ignored_ words _CONFIG-PERSONAL folder to be easily accessible to you for 
editing. 
GF = search-memorize List of memorized terms or regular expressions. This file is stored 
in the CONFIG-PERSONAL folder to be easily accessible to you for 
editing. 
P| toolbar short description 
GF New icon — IATE Besides the 20 DGT icons — three of which for DGT applications — 


there is now a direct link to IATE full-fledged interface. 


ia 
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|| titorpane | Short description 


@F «Improved colour Yellow background colour for below 100% match (now a different 


markings for 100% and 
below 100% matches 
and MT 


Auto-populated 
segments from external 
TM Xs colour-coded (in 
orange) 


Display of a second 
source language in the 
Editor (tmx2source) 


Mark segments for 
revision (in red) 


Go To Next/ Previous 
Revised Segment 


The order of documents 
can be changed 


Segment open for 
translation 


Order of matches display 


Repeated segment 
Found in 


Match display template 
— Configure format 


shade) and green background colour for 100% matches (also a 
different shade). The MT is marked with a grey background as 
before and the TRA/source/ Match information remains the same. 
A new feature is that the changes made by the translator (while 
the segment is not validated) are no longer highlighted, thereby 
calling the attention to the parts that were not changed. 


Auto-populated (pre-translated) segments are displayed in the 
Editor with an orange background if the options Save 
Auto-populated status in the Editing Behaviour menu and Mark 
auto-populated segments in the View menu are activated. 


Used to help translating from a relay language or to easily view 
terminology in a different target language for reference. 


Segments with a login different from the user’s login are displayed 
with a red background if — in the View menu — this option is 
ticked. This option automatically adds View diff in target in the 
Options — External TMXs menu so that track-changes are 
displayed in the target segments in the Fuzzy Matches pane. Used 
for the revision stage. 


Allows to quickly go to segments with an ID other than the login of 
the user. Used for the revision stage. Shortcuts: Ctrl+Shift+X/ Y. 


The order in which the documents for translation are displayed in 
the Editor can be changed in Project menu — Project Files. 


When reopening a project, OT opens the last segment that had 
been edited in the previous session in that project, if any. 


= Fuzzy Matches Short description 


GF Track-changes 
MR Track-changes in target 


Improved display (before it was compare differences) 


Option to select track changes in source (default) and also in target 
or in both in the Options — External TM Xs: View diff in source and 
View diff in target 


There are now 3 options: Full text including tags and numbers 
(default); Stemming, no tags and no numbers; No tags and no 
numbers. 


In the Fuzzy Matches pane you can also have information about 
the number of occurrences of a particular segment in the external 
translation memories — and also in the project memory — ina 
dropdown menu which lists the memories where that segment 
occurs. 


Attributes to be displayed with the segment in the Fuzzy Matches 
pane — which already existed in the OT public version and in 
DGT-OT 2012 — have been improved with new template variables. 
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ra View project documents Short description 


Pal cag View the source 


document with shortcut 


View the current target 
document with shortcut 


Already available in the DGT-OT 2012 but without shortcut 
(Ctrl+Shift-+H) 


New feature. It automatically creates the current translated 
document and opens it in_ its native application 
(Shortcut: Ctrl+Shift+G) 


|| searchfeatures | Short description 


Redesigned window 


Translated and/or 
untranslated fields 


Boolean NOT 


Glossary 


Notes 


Translator field 


Search by file or folder 
field 


Memorize 


List of memorized 
Regular Expressions 


Match display template 
— Configure format 


The DGT-OT 2012 Search window had been redesigned with new 
features (strings, whole words and lemmas), Search In Source and 
In translation and the possibility of customizing the segment 
attributes (as for the Fuzzy Matches). The Search Directory feature 
had also been separated from it. 


Now in DGT-OT 2014 the Search feature has been further improved 
(see features below) 


It is possible to set the search to translated or untranslated 
segments or both. 


Besides searching in In Source and In translation with the Boolean 
AND or OR, it is now possible to use the Boolean NOT in the fields: 
In source, In translation, In notes, Author and Translator 


It is now possible to Search in the glossaries of the project 


It is now possible to Search in the notes, if any, of the project 
memory and of the external translation memories. 


A new field Translator which allows differentiating between the 
Author of a tmx (which may be the translator or the assistant who 
post-aligned that document) and the translator. 


It is now possible to limit the search to one memory (tmx) file or to 
a folder with several tmx files, by writing/copying its path and 
name to this field. 


It is now possible to memorize searches — for the session, the 
project or all projects — in the fields: In source, In translation, In 
notes, in Search Scope, Author and Translator and for Regular 
Expressions and File or folder. 


When DGT-0T is installed, there is already a list of commonly used 
Regular Expressions memorized for all projects for the In source 
field. 


Attributes to be displayed with the segment in the Search window 
— which already existed in DGT-OT 2012 — have been improved 
with new template variables. 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


= Search and Pre-Translate Short description 


Ge From Source 


RCT From Match 


From Machine 
Translation 


Directory 


He Individual project 
memories for Euramis — 


Create Euramis Export 


Individual project 
memories for revision 
purposes — Create 
OmegaT Export 


Open Project Folder 
Open Glossary 

QA — Check Rules 
Show Same Segments 
Strip tags 

Write Notes to File 


Spellcheck 


Write Query Notes to File 


This is a brand new DGT-specific feature. 


Allows searching by some criteria and pre-translating the resulting 
segments by copying source to target. 


Allows searching by some criteria and pre-translating the resulting 
segments with external memories matches. The match threshold for 
pre-translation can be defined. 


Allows searching by some criteria and pre-translating the resulting 
segments with machine translation output. 


[44 Search Directory Short description 


M@F Redesigned Search 


In the DGT 2012 version, it was already a separate window. Now it 
has been improved. 


P| scripting Short description 


Brand new OT feature which allows using scripts developed by the 
open-source community around OmegaT. Some have been included in 
DGT-OT or can be selected. See below. 


A DGT adaptation of the publicly available script write_sel_files2TMX 
allows exporting memories by document (without notes and 
alternative translations) with the DGT attributes required to be sent to 
Euramis. Generated by pressing Ctrlt+Shift+F8 and selecting the 
documents to have memories exported from. 


A DGT adaptation of the publicly available script write_sel_files2TMX 
allows exporting memories by document (with notes and alternative 
translations). Used in DGT for the revision process. Generated by 
pressing Ctri+Shift+F9 and selecting the documents to have memories 
exported from. 


Allows opening the project folder from within Omegat. 
Shortcut: Ctrl +Shift+F1. 


Allows opening the writable glossary for editing in Notepad++. 
Shortcut: Ctri+Shift+F2. 


Allows carrying out a series of quality checks for the whole project 
or for the current document. Shortcut: Ctrl+Shift+F3. 


Displays a list of segments with source identical to target. 
Shortcut: Ctri+Shift+F4. 


Removes tags from matches in the target segment in the Editor. 
Shortcut: Ctri+Shift+F5. 


Allows extracting notes to an html file for processing. 
Shortcut: Ctrl+Shift +76 


Allows spellchecking in batch in the whole project or in the current 
document. Shortcut: Ctrl+Shift+F7. 


Allows extracting selected notes to an html file for processing. 
Shortcut: Ctri+Shift+F10. 
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P| Glossaries Short description 


Target term from the The target term of the writable glossary entries is displayed in bold 
project writable glossary in the Glossary pane. 
in bold 
iF = Improved display of The source terms are displayed in blue, the target terms in green 
glossary entries and the 3™ field in black characters. The third field is now 
displayed together with the respective target term. 
fF Entries in alphabetical Entries after the writable glossaries entries are displayed in 
order alphabetical order. 
GF Filtered IATE glossary When creating a project with the DGT-OT Wizard, a filtered 
extraction from IATE is automatically copied to the project 
\glossary folder. 


= Auto-completion Short description 


Brand new public OT feature. 


From Glossary It is possible to use the project glossaries for auto-completion. 
From Auto-text It is possible to create and use abbreviations for 
auto-completion. 
From Character table It is possible to insert character/symbols from a (customized) list. 
From tags Tags can also be inserted selecting them from a dropdown menu 
Pas | shortcescription 
Insert next missing tag New feature which allows to easily insert (first or) next tags one 
by one. Shortcut: Ctlr+T 
Validate tags for current Now tag validation can be also done for only the current 
document document. 
Tag validation improved The listing of tag mismatches has been improved. 
P| statistics | Shortdescription 
Match Statistics per File OT now also provides match statistics for each document with 


the indication of repetitions in documents and between 
documents, thereby giving an immediate snapshot of the work 
involved in each of the documents 


GF OT_Stats excel sheet Excel sheet where you can manually copy/ record your progress 
in the translation of your project. Can be accessed via the 
DGT-OT Wizard (Stats) or from OT. 
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— PART A— 
OVERVIEW 
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A.1. DGT CAT Environment 


Translating in the 21s‘ century is — as everything in life now — an activity that integrates a whole series of (IT) resources 
and applications. Therefore, we cannot look at a CAT Tool isolated from the environment in which it is — in a certain way 
— the “maestro” interlinking with other applications, frequently in real-time. 


So to efficiently translate and manage projects using OmegaT in DGT context, it is worthwhile to know its working 
environment as when you translate you are — or can be — in fact using a range of language applications and 
documental resources that you can optimise in different ways. 


Knowing what is available — and their advantages and shortcomings — can contribute a lot to the quality of life of a 
translator! 


So, let’s first have a brief look at DGT’s CAT Environment and the “building blocks” that are used in your daily work and 
which are, to different degrees, integrated with or linked to DGT-OmegatT and/or its Wizard. 


® General information on DGT’s CAT Environment: Translation tools and workflow 


® Detailed Index at the end of this Guide with clickable links. 


A.1.1. SDL Trados Studio 2014 


=] Studio is the main CAT tool in DGT since 2013 when it replaced Trados Translator’s Workbench. It is a standalone 
application that can be used with TeamBase and other applications to allow translators working on a project to share 
memories in real time. 


A.1.2. CAT Client 


MM the CAT Client is the Wizard developed in-house to integrate Studio in DGT’s workflow. 


A.1.3. DGT-OmegaT 


t 
2, OmegaT — the main subject of this Guide — is an open-source CAT tool which has been adapted by DGT for 


prototyping purposes. It is a standalone application that can be used with TeamBase to allow translators working on a 
project to share memories in real time. 


It has been made available to translators in June 2012. It is used by some translators as an alternative CAT tool. It is 
internally called DGT-OmegaT as it has some new/improved/adapted features compared to the public version. 


In the present implementation of OmegaT in DGT, (single or multi-document) projects are created locally, but OT also 
allows working on a server or partially redirecting, for instance, project glossaries or external memories subfolders to a 
server while keeping the main project locally in each translator's computer. 


A.1.4. DGT-OmegaT Project Wizard a 
of 


The DGT-OmegaT Project Wizard (DGT-OT Wizard) — also an important subject in this Guide — is an in-house 
application which integrates OmegaT in DGT’s workflow by making the link between Tradesk, Windows Explorer and 
DGT-Omega¥, thereby making it very easy and fast to perform most of the project management key operations. 


[eo 
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A.1.5. TeamBase B® 


TeamBase — also an important subject in this Guide — is an in-house server application that allows translators working 
as a team sharing memories in real time with colleagues who are working on the same or related projects and are either 
using DGT-Omegart or the main CAT tool. 


A.1.6. Trados Translator’s Workbench 


The old TWB is still used to translate Recast documents as its Commission-specific format is not accepted by the new 
CAT tools used in DGT. It is also used to translate RUE (confidential) documents. 


A.1./7. Tradesk i 


TEANSLATOR 


“esse? Tradesk is DGT’s document management system developed in-house which stores the original documents 
to translate, reference material to be used and the ongoing and released translated documents. 


It is the starting point to create or update OT projects with one or several documents and respective translation 
memories and an IATE glossary. 


A.1.8. Euramis #8 


Euramis is DGT’s Translation Memories repository which stores all the translations done in DGT for more than 
15 years, as well as in some other EU institutions, and from which retrievals and reference documents are extracted in 
tmx format and to which project memories of translated documents are sent also in tmx format. 


At present, the Euramis central memory contains more than 645 million segments covering all EU official languages. 


Euramis is not publicly available but DGT has published two large corpora in tmx format: DGT Translation Memory and 
DGT-Acquis 


® Joint Research Centre website: DGT-Acquis and DGT Translation Memory 


A.1.9. MT@EC a 


|/MT@EC Unlike the public OmegaT, DGT-OmegaT uses machine translation files generated by the in-house MT@EC 
service — based on the Moses open-source Statistical Machine Translation system — which are copied to a separate 
project subfolder (\mt). 


®MT@EC: DGT Magazine Languages and Translation 


@ Moses: Moses and MosesCore research project and TAUS — Machine Translation and Moses Tutorial. 


A.1.10. IATE a 


IATE (Inter-Active Terminology for Europe) is the EU inter-institutional terminology database. It has been used in 
the EU institutions and agencies since the summer of 2004 for the collection, dissemination and shared management of 
EU-specific terminology. It replaced the old Eurodicautom database — which had been used since 1973 — and 
incorporated all its data. 


IATE contains over 8.4 million terms, including approximately 540 000 abbreviations and 130 000 phrases and covers all 
the 24 EU official languages, as well as Latin. 


® validated entries are made available in the public IATE website and an extraction of it is also made available here. 
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A.1.11. Windows 7 and Office 2010 


The operating system currently used in DGT is Windows 7 and the public Omega? relies mostly on Windows Explorer 
for the management of projects. 


For operations which are not automated via the DGT-OmegaT Project Wizard, Windows Explorer allows to 
copy/paste/move, drag/drop, rename and delete files in a very easy and flexible way. 


In fact, when a project is reopened in Omega’, it has no “recollection” of how the project was before and just accepts it 
as it is. So any changes made via Windows Explorer are accepted. 


A.1.12. Quest || aa 


= Quest II is a DGT metasearch tool designed to reduce the time it takes translators to find solutions to terminology 


problems by enabling to search, in a single operation, a multitude of DGT’s internal and public terminology sources. 


® See profile in Quest interface. 


A.1.13. DocFinder a 


od DocFinder is a DGT application which allows to quickly locate and display an EU official document based on 
references, notably from EU Treaties, legislation and EU bodies’ documents. 


DocFinder is DGT’s implementation of WebExtractor, an application developed in collaboration between Giancarlo 
Piovanelli from the European Economic and Social Committee and Jodo Rosas from DGT. 


A.1.14. Eur-Lex a 


@ Eur-Lex is the online repository of published EU legislation which can be accessed by anyone free of charge here. 


It contains the treaties, secondary legislation and preparatory acts in all the EU official languages, as well as national 
implementing measures and case-law of the Court of Justice of the European Union. It is also possible to consult the 
Official Journal of the European Union. 


A.1.15. Vista — SGVista 


Des Vista is the Commission's Secretariat-General database that gives access to non-classified Commission 
documents and to related procedural information. Vista is the successor of SGVista. 


SGVista is still available because it contains Council and EP documents as well as interinstitutional files. Vista and 
SGVista are internal systems for Commission staff. 


A.1.16. DGTVista 


DGTVista is a DGT document search and viewing engine. It contains all incoming (mainly original texts) and 
outgoing documents (mainly translations) from and to every Commission department since 1994. 
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A.1.17. Euramis Alignment Editor (Pedit) a 


Pedit is an in-house Translation Memories Management application which allows the editing of (automatic) 
alignments. 


A.1.18. Voice recognition 


Dragon Dictate or DictaTrans are available and can be used with OmegaT. 


A.1.19. XBench 


ya The freeware version of XBench provides simple and powerful Quality Assurance and Terminology Management 
and can be used — as a complement to OmegaT QA features — as it is installed in DGT computers. 
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A.2. DGT translation work 


As the Commission has the power of legislative initiative within the EU, the translation work in DGT is a bit different 
compared to the other EU institutions. 


Translation work in DGT may be the translation of a one-page press release or a project with a few documents, some 
dozens of pages and involving only 1 translator and 1 reviser. In this case the workflow is very simple. 


However, things can get (much) more complicated with large projects — frequently in (very) technical domains — with 
hundreds (or even thousands) of pages translated over a period of months and frequently with (a high number of) new 
versions while the documents are being translated into (up) to 23 EU official languages. 


These projects usually involve several translators and even several revisers, not to mention that sometimes part of the 
documents in a project may be translated in-house and another part by freelance translators. 


Furthermore, in DGT there is not the bilateral “client — translation provider” relationship in which terminology validation 
is decided once the translation has been accepted by the client. 


DGT is, most of the times, the starting point of legislative proposals. The authors in the Directorate-Generals produce the 
documents, but there are also the lawyer-linguists, the national experts and the Member States representations which 
may be involved, namely in what concerns terminology for sensitive or very technical matters. 


Besides that, legislative proposals have a life of their own and a substantial part may involve a co-decision procedure in 
which the Council and the European Parliament — and of course their respective Translation Services — are involved. 


Terminology — like the language — evolves all the time and, for our work, there is the further complexity of legislation 
and terminology in many domains that evolve in this interinstitutional context. 


So, things are all but straightforward and — as in DGT the translators are the managers of their projects — the flexibility, 
user-friendliness and speed of OmegaT in what concerns, namely, the creation and updating of projects, the prioritising 
of translation memories, the use of machine translation and of small or large glossaries (including IATE extractions), is of 
particular importance to DGT translators. 


A.2.1. Project approach 


Nowadays, CAT tools follow a multi-document project approach. You work with projects, which may have one or any 
number of documents in various formats — with hundreds or even thousands of pages on the whole — with their 
respective retrievals and reference document memories (tmx files) from Euramis, MT output, glossaries (an IATE 
extraction), dictionaries, monolingual reference documents and any other material — technical or administrative — you 
may need to translate/manage your project. 


In DGT-OT, the documents are treated in a single project in a speedy way, even when there are successive new versions 
during the translation of the project. Each document is individually identified so that you always know where you are. 


A.2.2. Documents in a project 


The original documents can be: 


v_ One, some or all documents of large packages (for example, European Semester, Budget, Rail Package, 
Multiannual Financial Framework) with different dossier and part numbers (Example: RTD-2014-80020-00-00 + 
RTD-2014-80020-00-01 + RTD-2014-80021-00-00 + RTD-2014-80023-00-00). 


Documents of dossiers with numerous parts (like a model contract with several parts), 
SRC files (typically Excel or PowerPoint files with tables, graphs) to be integrated in Word documents, 
New versions of documents (quite frequent), 


Several small documents which you may want to gather in a project (like a number of replies to parliamentary 
questions, cartouches) even if unrelated, just to save time, 


v_ Any combination of these. 


<<c<cdc 
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If your documents have a subfolder structure, OT will keep it. You can also easily change the order in which OT displays 
the documents of your project in the Editor for translation. 


In the present implementation of DGT-Omega’, the source Office documents are not converted to xliff format. Therefore 
you have as source files the original documents and OT saves your translation to the project memory. 


A.2.3. Document formats 


The documents to translate may be in different formats (Word, Excel, PowerPoint, x(html), etc.), but they will all be 
displayed the same way as “raw” text in the Editor pane. This is now a standard approach in open-source and 
commercial applications and therefore you no longer work with formatted documents. 


A.2.4. Commission special formats a 


The Commission has some special formats. With Budget documents, there is no problem, you can use Omegalv. 
However, as the budget documents are pre-treated and partially translated from previous budgets, those documents will 
have to be post-aligned. 


With Recast documents, it is not possible to use OmegaT or DGT main CAT tool and you will have to use the old TWB. 


A.2.5. Confidential documents 


Projects can be easily created locally, directly in OmegaT, in the case of documents that cannot be transmitted via the 
network (SECEM documents). 


A.2.6. Merging documents 


All the documents in your project are, by default, automatically merged and treated as a project unit. The 
auto-propagation of the translation of identical source segments (non-unique segments) is done automatically in all the 
documents of your project — in a background operation you don’t notice — and those segments are displayed greyed. 


A.2.7. New versions of documents already in translation 


Due to the EU requirement that legislative proposals in all language must be publish simultaneously, in DGT there is a 
high percentage of new versions when the translation into (up) to 23 EU official languages is already ongoing. 


In OmegaT, you can update your projects with new versions very easily without the need to create a new project. 


For projects that may have, for example, dozens of documents — and sometimes several new versions for some or all of 
them — and hundreds of pages, the possibility of simply adding the new version and deleting the previous one in a few 
clicks and in a few seconds/minutes makes project management a fast and easy task. 


A.2.8. Translating multilingual source documents 


In DGT, sometimes there are documents in 2 (or more) languages. You can translate these multilingual documents in a 
single project without the need to divide the document by source languages. However, the project memory — which will, 
by definition, be multilingual — should not be sent to Euramis. 


A.2.9. Translating with the help of a relay language (tmx2source) ® 


Due the number of combinations between the 24 EU official languages, sometimes it is necessary to translate 
documents via a relay language or with the help of a relay language. 


With OmegaT you can easily use a relay language as a complement, having both the original language and the relay 
language displayed (one below the other) in the Editor as source segments. This feature may also be useful if you want 
to see how a particular document has been/is being translated into another language, for terminology consistency 


purposes, for instance. 
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A.3. DGT-Omegat, its Wizard and TeamBase in a 
nutshell ™° 


Omegat is a CAT tool which may not be high on looks nor in the myriad of options ... but which is very user-friendly and 
simple to use and — surprisingly — very sophisticated in its own way! 


In this Section is given general information on its more important features ... that you may want to explore further in the 
relevant detailed sections according to your work method and needs. 


®D Detailed Index at the end of this Guide with clickable links. 
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Screenshot 1 — OmegaT panes: Editor, Fuzzy Matches, Machine Translation, Glossary, Notes, Multiple 
Translations and also the TransTips (Translation Tips) feature. Dictionary and Comments panes minimized. 
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A.3.1. Management of DGT-OT projects a 


As in DGT the translators are the managers of their projects, it is important that the management be simple, fast and 
user-friendly. 


The public OmegaT has basic management features. This is not a shortcoming, it is just the approach taken: not to 
duplicate what can be easily done using a File Manager system like Windows Explorer. 


For DGT, this is definitely an advantage because it allows our IT Unit to easily integrate OmegaT in DGT workflow in a 
very flexible way, interlinking with Tradesk and Euramis and other applications. 


For that purpose, DGT developed in-house the DGT-OmegaT Wizard which helps you to quickly manage your projects. 


ocaLocal Documents - eo bachup\DG T\Dossies\CNEC 


om memory 


here mode in req 


=o a 


OT Wizard dialog bax 


Screenshot 2 — DGT-OmegaT Project Wizard window and features 


You can also easily work in shared mode (by interlinking with TeamBase), update projects with new original documents / 
new versions of documents already in the project and/or new memories, delete documents/memories from the project, 
add glossaries (e.g. an extraction of IATE), organise translation memories and archive finalized projects. 


The final part of the workflow — sending (ongoing or finished) translated documents to Tradesk and the document 
memories to Euramis — is in the pipeline. But it is simple and fast to do it manually. 


The DGT-OT Wizard also triggers automatic backups (every 10 minutes) of your active project to a server (your space in 
the H: drive) in a background operation you don’t see and without interrupting your work. 


| 
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A.3.2. DGT-Omegal project structure B&@ 


The DGT-OT Wizard will create the DGT-OmegaT project in a _ subfolder of the 
C:\Users\{your login}\AppData\Local\DGT\OmegaT_Projects default folder. 

@ Make ita favourite in your computer! 

DGT workflow is based on extractions (retrievals) from Euramis and MT output, both available as tmx files via Tradesk. 


The DGT-OT Wizard creates the OT project with the original document(s) — wiping useless tags in the process 
(TagWipe) — and the respective memory (tmx) files available for download in Tradesk: retrievals, aligned reference 
documents and machine translation, if any (pre-processed for EN— other languages), and also an IATE extraction. 


The DGT-OT Wizard creates the project making TagWipe and segmentation rules project-specific. Therefore, if any of 
them are changed/improved by the IT Unit during the translation of a lengthy project with (many) new versions, there will 
be no change in those rules for your particular project and you won't have unduly untranslated segments. 


In DGT, the DGT-OT Wizard is the access point to OmegaT (OT it is not accessed directly). 


THE DGT-OMEGAT PROJECT STRUCTURE 
when a new project is created via the DGT-OT Wizard 


= Monolingual or bilingual 
b. dictionary dictionary(es) , if any 


). euramis 
E é t export-omegat 


glossary create automatically a writable 


+ lossary for the project 
bicad aa mt g y proje 


omegat 


Glossary(ies) if any —If none, OT will 


Working project memory — 


one for the whole project source 


TagWipe 
target 
tm 
Properties file created with the ® omegat.project 
project. Don't touch i! 
# ) OTStats_T-ELARG-2014-80031-80034.xisx 
© T-ELARG-2014-80031-80034-levell.tmx 
Global project memories created with ® T-ELARG-2014-80031-80034-level2.tmx 
Ctri+{ Shift+) D— without notes 
® T-ELARG-2014-80031-80034-omegat.tmx 


Screenshot 3 — Typical structure of a DGT-OmegaT project also including folders/files created during the 
translation of the project 


You can update your project at any time during the translation process with new original documents and/or aligned 
reference memories and glossaries without the need to create a new project... unless you choose to do so. 


If you update your project with new documents or new versions of documents already in the project, when you reopen 
the project, the 100% segments (including formatting) that you had already translated — which are stored in your project 
memory — will be automatically displayed in the Editor pane as translated without any other action on your part. 


You can add as many subfolders as you want to the project folder — for example, to have monolingual reference 
documents which can be used for Search purposes or to gather any other information, either technical or administrative 
— but you must not delete or rename any of the subfolders or files that were automatically created for each 
project. 


™ If you do, OT may not work properly... or at all! 
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The Open Project Folder feature is a new OT feature that enables you to access the project folder from within Omega«, 
Opening it in Windows Explorer. It is an alternative to using the DGT-OT Wizard for some management operations. 


However, some continue to be best performed — or are really a must! — via the DGT-OmegaT Wizard, namely creating 
and updating projects, working in shared mode in real time (TeamBase) and having automatic backups every 10 
minutes. 


A.3.3. Sharing memories in real time — TeamBase ®&@ 


You can share memories in real time using the DGT TeamBase application. With it, you can control when and how you 
want to work in share mode — in read/write or only read mode — depending on your work method. 


“¢ Omega? Project Wizard (20141001) 


= 


<a TeamBase Memories 


v 
a | | Available shared memories _ Shared memories linked to my project _Shared memories | own 
/DEVCO-2012-90005 |DEVCO-2012-90005, 
JENTR-2014-00379 | 


~ 
2 Ila lok 


i} DEVCO-2013-2014-GRANTS-80C | | DEVCO-2012-90005 | Read/Write ] 


Connect To Shared ] Disconnect From Shared [ Delete shared memory 


Screenshot 4 — TeamBase window 


With the DGT-OT Wizard, you (or any another translator) can create a TeamBase memory for a project which is, in fact, 
just another translation memory fed with a copy of the segments validated by the translators connected to it in 
Read/Write mode. The results are displayed in the Fuzzy Matches pane and identified with the usual attributes 
preceded by “TeamBase” so that you know where the segment came from. 


Both 100% and partial matches are immediately available to all the translators linked to a particular TeamBase memory. 


You can share your translation — segment by segment in real time — from the very beginning of your translation just by 
connecting to the TeamBase memory of your project and, if you want, of other projects that may be ongoing (Read/Write 
mode). 


You can also decide to only share your segments when you consider that they are already sufficiently “good” to be of any 
use to others, but even so you can receive segments translated by others (Read mode). This feature is particularly 
useful if you first do a fast draft translation which you afterwards improve. 


TeamBase memories are bilingual — created for each language pair — and can be accessed by any translator who is 
translating a project with that language combination, be it in one Unit, in the whole Language Department or in other 
Language Departments... if they are using TeamBase too, of course. 


[| 
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A.3.4. Machine Translation (MT) &&@ 


In Omega’, there is a Machine Translation pane which displays MT output separately therefore never “mixing” human 
and MT translation in the Fuzzy Matches pane (where translations retrieved from Euramis are displayed) nor in the 
Search window. 


The MT files are automatically stored in the \mt subfolder when the project is created with the DGT-OT Wizard. 
In DGT, machine translation is produced by the MT@EC service which is run by DGT. 


A.3.5. Translation memories 


In Omega’, the project memory (project_save.tmx) — which contains all the segments you have translated — is stored 
in a separate project subfolder (lomegat) and the reference/retrieval translation memories are stored in another 
subfolder (\tm), thereby making it easy (and safe) to manage all the project files — and notably the aligned reference 
document files — you want to use (or no longer use) for a particular project. 


A.3.5.1. Pre-translate (auto-population) @ 


OT gives you complete control over the external reference memory(ies) you want to use for pre-translation. You just 
have to copy one or more tmx files to the \tmlauto subfolder of your project and the 100% (including tags) segments will 
“auto-populate” your documents. 


Anew OT feature is that those segments will, by default, be highlighted with an oral d to call your attention 
to the fact that they were automatically transferred to your project memory. They will remain so highlighted unless you 
modify those segments. 


You can use pre-translate before starting translating a project or at any time in the middle of the process. In the latter 
case, only untranslated segments in your project memory will be pre-translated. Segments you had already translated in 
your project will remain untouched. 


When you Update a project with a new version of an original, there is no need to pre-translate as your 
translated segments are all in the project memory and will be automatically inserted in the new versions (if they 
are a 100% match including tags) without any action on your part. 


You can also pre-translate — or copy source to target in batch — in filtered segments of your project using the new 
DGT-specific Search and Pre-translate feature. 


a When using Euramis memories for pre-translation, take into consideration that they may be of post-aligned 
documents and therefore they may contain misaligned segments! 


A.3.5.2. Reference Memories by subjects/subfolders, with priorities or penalties 


With OmegaT you can easily organise your reference external memories — i.e. memories other than your project 
memory —for a particular project in any way you want. 


You can have several "reference" memories and rank them — or give them penalties — according to your preference 
(either individually or by groups in subfolders) so that the first results displayed come from those you give priority to. 


In the Fuzzy Matches pane, matches will be displayed by match rate first and then by the order of priority, if any, you 
have given to your reference memories. You can organise them easily via the DGT-OmegaT Wizard — or directly in 
Windows Explorer — just by moving, copying or deleting files in the \tm project subfolder. 


& For complex documents/dossiers with many reference documents of different priority (for example, EU Treaties 


have precedence over Regulations and Directives, but these have precedence over Communications, Reports, 
etc), this can save a lot of time and assure consistency with previous main reference legislation! 
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A.3.5.3. Reference memories with a different target language displayed as Fuzzy 
Matches 


OmegaT allows the use of translation memories with the same source language but with a target language different from 
the one of your particular project. This may be useful, for instance, to check terminology in a different target language if 
there is already a previous translation of the same (or of a similar) document that can be used as reference. 


In order to have matches you only need to have the respective tmx files in the \tm subfolder. OT accepts any memory 
without “censorship”. 


& If you use this feature, you may give a penalty to these memories if you want! 


A.3.5.4. Translating with the help of a relay language (tmx2source) ® 


OmegaT also allows the use of a (reference) external translation memory with the source language identical to the 
source language of your project and the target language of the reference memory different from the target language of 
your project, displaying it in the Editor below the original source segment. 


Example: You have an LT document to translate into PT and your knowledge of LT is not perfect (or far from it)... but 
there is already a LT-EN translation of the same document. In this case you can create a LT-PT project and use the 
LT-EN external translation memory to have the EN relay language displayed in the Editor below the LT original segment. 
In this way you can see both the LT and EN as original segments. 


You can also use this feature just to display a translation into any another language for terminology purposes. 


A.3.5.5. Sharing external memories on a server 


If you are working on a project with several other translators and — besides using TeamBase to share segments in 
real-time — also want to share external memories, you (and your colleagues who are also working on that project) can 
do it by redirecting the location of the project \tm subfolder in the Properties menu. 


This way, if you have a coordinator for that project, reference memories can be organised “centrally” to be used by all 
translators/reviser(s). The project memory of each translator's project will remain in the \omegat folder stored locally in 
your/their computers. 


A.3.6 Terminology — glossaries B® 


In OmegaT, you can use as many glossaries as you want. If you have one or more glossaries you wish to use in your 
project, you can just copy it/them to the project \glossary subfolder to use as read-only. 


As OmegaT can use very large glossaries without a significant impact on its speed, you can even have glossaries of 
more than a million entries (with 3 fields: source, target and a comment). 


The display of entries in the Glossary pane has been improved in DGT-Omegav. 


A.3.6.1. IATE term extraction ®&@ 


When you create or update a project with the DGT-OT Wizard, if you accept the default for IATE, you will have a filtered 
IATE extraction (source term and target term) automatically added to your project to be used as a read-only glossary. 
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A.3.6.2. Writable glossary B® 


The first time you create a terminology entry in your project, OT will automatically create a glossary for that project (in txt 
format), which is the writable glossary. 


You can also define one — and only one — glossary that you already have as your writable glossary, i.e. the glossary in 
which your new entries will be stored. For that glossary to be read by OT as writable the simplest way is to (re)name 
“glossary”. 


A.3.6.3. Writable glossary accessed from within OmegaT @ 


The Open Glossary feature (Ctrl+Shift+F2) enables you to open — in Notepad++ — the writable glossary file from 
within OmegaT so that you can modify or delete entries, in a batch operation. 


A.3.6.4. Glossary entries in each open segment (TransTips) 


The terms/strings with a blue linear and bold underline (by default) mean that there is an entry in one of the glossaries 
(displayed in the Glossary pane) of your project. 


By right clicking on the mouse, the translation(s) of that term/string will be displayed in a dropdown menu. You can select 
to insert it at the position of the cursor in the target segment in the Editor by clicking on it. 


A.3.6.5. Sharing glossaries on a server 


If you are working on a project with several other translators and — besides using TeamBase to share segments in 
real-time — you also want to share glossaries, you (and your colleagues who are also working on that project) can do it 
by redirecting the location of the project \glossary subfolder in the Properties menu. You can share the non-writable 
glossaries and each translator can have its own writable glossary accessible to all the translators working on that project. 


This way, if you have a coordinator for that project, glossaries can be organised “centrally” to be used by all translators. 


A.3.6.6. Use of glossaries in XBench 


OmegaT glossaries can be used directly in XBench for QA purposes. 


A.3./. Dictionaries 


You can have as many dictionaries as you want/can find, either bilingual or monolingual. There are dictionaries freely 
available on the Internet that can be downloaded — both monolingual and bilingual (of uneven quality) — for most of the 
EU languages. There is a Dictionary pane where terms present in the dictionaries are displayed. 


By default, there are no dictionaries available when installing DGT-OmegaT. 


A.3.8. Statistics B® 


OT provides statistics for the documents within your project and match statistics for the whole project. 


In this new version, it also provides match statistics for each document with the indication of repetitions in documents 
and between documents, thereby giving you an immediate snapshot of the work involved in the translation of each of the 
documents. This feature is also very useful to distribute the documents when there are several translators involved in the 
translation of a project. 


OT also provides statistics of how much you have already translated and what remains to be translated. Besides, it 
displays and continuously updates the number of (unique) translated/non-translated segments, by document and for the 
whole project. 


When you create a project with the DGT-OT Wizard, you also have automatically copied to your project main folder an 
Excel sheet — OT_Stats S@ — in which you can manually copy/record your progress in the translation of your project. 
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A.3.9. Backups 


When creating a project, the DGT-OT Wizard will do an automatic backup of it to your personal space in the H: drive: 
H:\CAT\OmegaT_Projects. This copy is to be used in case there is a problem/crash in your computer. 


This backup is automatically updated every 10 minutes — in a background process without interrupting your work — for 
the active project defined in the Project field of the DGT-OT Wizard. 


™ So don’t close the DGT-OT Wizard after opening a project! 


Furthermore, OT itself does automatic backups of your project, by default every 3 minutes, which are saved locally in the 
lomegat subfolder of your project, so the risk of losing any of your work is minimal. 


If you are a teleworker, OT will not slow you down as it is not a resource-hungry application and backups are done in a 
fast background process without you noticing it. 


A.3.10. Sending translated documents to Tradesk 


When you have finished the translation of your documents — or while translation is still ongoing — you can send them to 
Tradesk using the Upload feature in Tradesk. The automation of this process is in the pipeline! 


A.3.11. Sending memories of translated documents to Euramis B® 


When you have finished translating your documents and have used OT until the very end (revision included, if any), you 
can send the individual document memories to Euramis with the correct DGT attributes, both in single document and 
multi-document projects. The automation of this process is in the pipeline! 


A.3.12. Copying projects to another location/computer 


As when a project is opened in Omega’, it has no “recollection” of how it was before, you can just copy/paste a project 
folder to another location of your computer or to another computer via Windows Explorer and open the project as usual 
in the other location. 
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A.4. How OmegaT works in a nutshell 


In the present implementation of OmegaT in DGT, original documents are not converted into xliff files. They are directly 
processed by Omega from the source formatted files present in the project source subfolder. 


To understand how OT works — and what to watch out for ! — here is an illustration of the process. 


WHAT HAPPENS WHEN YOU FIRST CREATE A DOCUMENT IN A PROJECT, TRANSLATE, 
GENERATE THE FORMATTED TRANSLATED DOCUMENT AND CLOSE THE DOCUMENT 


tst Create 
CONVERTED 


ORIGINAL ‘AND SEGMENTS Translated 
TRANSLATED 
DOCUMENT (ggmeua pepe 


Document 


remains TRANSLATED 
unchanged DOCUMENT 


WHAT HAPPENS WHEN YOU REOPEN THE DOCUMENT IN THE EDITOR TO CONTINUE 
TRANSLATING AND GENERATE AGAIN THE FORMATTED TRANSLATED DOCUMENT 


SEGMENTS IN 
THE PROJECT tst Create 2nd Create 


ORIGINAL CONVERTED MEMORY Translated Translated 
INSERTED IN ocumen’ Document 
DOCUMENT Senmyren THE EDITOR 


sepa FORW TIED FORMATTED 
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unchanged 


Figure 1 — Opening and reopening projects and creating and recreating translated documents 


After creating a project, every time you open it, your original document(s) will be segmented and processed and the 
segments that you may have already translated (saved to the project memory in the project lomegat subfolder) that 
have a 100% match (including tags) are displayed in the Editor pane. 


The 100% match segments in your project memory are never displayed in the Fuzzy Matches pane. In this pane are 
only displayed matches from the external translation memories in the \tm subfolder or below 100% matches from your 
project memory. 


As every time you reopen a project OT repeats the segmentation and processing operations, when you add or delete 
documents in your project, OT will “accept” the different set of documents and will display in the Editor pane the 100% 
translated segments stored in your project memory. 

In Omega’, this is not considered a pre-translation. Pre-translation is when you use external memories — that 


you copy to the \tmlauto subfolder — to automatically transfer to your project memory all the segments with a 
100% (including tags) match in that/those memory(ies). 


Those segments will be transferred (“auto-populated”) to your project memory and displayed in the Editor and 
(by default) highlighted in a background colour to call your attention to it when you open that project or do 
Reload. 
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The translated segments in the project memory which, for the new set of documents, have a match lower than 100% will 
be displayed in the Fuzzy Matches pane as “orphan” segments. For an equal match, they will be displayed first. 


When you do Create Translated Documents or Create Current Translated Document, OT will, respectively, convert 
all the documents or the active document to the format of the native application and store it/them in the project \target 
subfolder. 


When you repeat this operation — recreating the translated document(s) — all the translated documents previously in 
the \target subfolder will be deleted. 


™ So be careful not to make changes in the translated documents in this subfolder as they are deleted if you 
recreate the translated document(s). 


& A good idea is to save the documents you want to change in their native applications to a new folder that you can 
create in your project (for instance, Documents_for_revision) and/or upload the translated document to Tradesk. 


You can generate the formatted document(s) as many times as you want, taking into consideration that any changes you 
make directly (formatting or content) in the document native application will not be transferred to the OmegaT project 
memory. 


™ So, don’t make any changes in the translated documents in their native application unless you are sure you don't 
want to use OT again for the translation of that particular document/project. 


When you finish the translation of one or more documents in your project, you must use the Tradesk Upload feature to 
have the documents copied to Tradesk to be released. 


You can also send the individual document memory(ies) to Euramis if you have finished your translation — including 
revision, if any — using OmegaT. If not, the document(s) will have to be post-aligned. 
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A.5. DGT-OmegaT — Editing in a nutshell 


In this Section is given general information on the more important features just to give you a general idea about the OT 
editing features ... that you may want to explore further in the relevant detailed sections according to your work method 
and needs. 


® Detailed Index at the end of this Guide with clickable links. 


A.5.1. Merging of documents and auto-propagation 


All the documents in your project are automatically merged and treated as a unit by default. 


OT numbers the segments of the documents sequentially from the first segment of the first document to the last segment 
of the last document and displays the name of each individual document in the Editor ribbon so that you always know 
which is the document you are working on. 


When you start translating, OT will auto-propagate the translation of non-unique segments (repetitions), if any, 
automatically — in a background operation you don’t notice — for the whole project. 


lf you want to inactivate auto-propagation, in the Project — Properties — Edit Project menu, untick 
Auto-propagation of Translations. 


™ lf you do it when you are in the middle of the translation of a project, the segments that were already 
auto-propagated will remain so. Only non-unique segments that were not translated before will not be 
auto-propagated. 


If documents are added to or deleted from the project — by adding or deleting them to/from the \source folder — when 
the project is reopened or you do Reload, OT merges again the new set of documents. 


For a new version of a document previously in the project (and the previous version is deleted from the project) with 
alternative translations — i.e., the same source segment has 2 or more translations, which are known as “alternative 
translations” — those alternative translations will be correctly inserted in your new text if the new non-unique 
segments are preceded and followed by the same segments (equivalent to “perfect match” in OT). 


A.5.2. Number of open documents 


OT cannot have several documents open at the same time, but as it treats your project globally, its powerful Search 
feature will search in all the documents inside your project — no matter how many they are — and the Fuzzy Matches 
pane will display matches from all the documents in the project. 


You can also do Search/Replace and Search and Pre-Translate in all the documents of your project. 


A.5.3. Segment display B® 


Source and target segments are displayed vertically one on top of the other. This applies to the Editor, the Fuzzy 
Matches and Machine Translation panes. 


There are several options to display segments in the Editor: display only target or source and target segments; mark 
non-unique segments, segments with notes, translated segments, non-translated segments, revised segments, 
modification info, etc.. 


You can choose at different stages of your translation to have them displayed differently just by clicking or unclicking the 
relevant option in the View menu. 


&~ Personally, | prefer this display, by far, to the table (side-by-side) display as it is less tiring for the eyes... 


especially when translating for long hours! 
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A.5.4. Segment status 


In OT, there are 2 segment statuses: untranslated and translated. All translated segments are stored in the project 
memory and are available for Fuzzy Matches. 


If you leave the target segment empty, the original will be used when the document is created in its native application. 
So, if a particular segment is not to be translated, you don't have to copy the source text to the target segment ... but you 
can do it if you want. 


™ If you do not copy the source to the target segment, in the statistics, namely in the count displayed in the OT 
window below with the translated and untranslated segments, those segments will be counted as untranslated, of 
course. 


a If you delete the text in an already translated segment using the option “Set Empty Translation’ in the Edit 
Menu, in the translated document in the native application there will be an empty line at the place of that segment. 


A.5.5. Segment editing 


Besides the editing function in OT described in this Guide, you can also edit your segments using the general shortcuts 
Ctri+A, Ctrl+C, Ctrl+V, Ctrl+X, Ctrl+Z, Ctrl+Y. These only apply to the segment open for editing in the Editor pane. 


A.5.6. Segment identification a 


The segments you translate in a project are identified with your login, date and hour and saved in the project_save.tmx 
file in the project \omegat subfolder. 


When you change the translation of a segment, the previous translation is discarded, which means that after validating 
the segment you cannot go back to the previous translation. 


All the segments in the project memory have your login with the exception of segments that have been pre-translated 
(auto-populated). Those segments are identified with the login recorded in the translation memory used for 
pre-translation unless you open that segment and change it. If you don’t change it, the segment identification will remain 
unchanged too. 


When revision is made in OT, the segments changed by the reviser will have the login of the reviser, of course. 


A.5.7. DGT attributes for Fuzzy Matches and Search a 


Segments from translation memories are, by default, identified with the name of the \tm folder and subfolders (if any), 
the name of the file, the date, the name of the translator (if available) and the match rate. 


The translator can change the attributes both for the display in the Fuzzy Matches pane and in the Search window. 


A.5.8. Track changes in Fuzzy Matches B&@ 


The differences between the source segments and less than a 100% matches in the translation memories are now 
displayed in a more user-friendly way as track-changes in the Fuzzy Matches pane. 


The translator can also choose to see the track changes in the source or the target segments, or in both. The display of 
track changes in the target segment is used in the revision process. 
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A.5.9. Fuzzy Match percentages 


There are three match estimates available: 
y Match percentage (taking into account tokenizers) 


Y Default OmegaT match: number of matched words — with numerals and tags ignored — divided by the total 
word count 


yY OmegaT match, including numbers, tags 


Example: 100/91/95%. As you have also to guarantee that formatting will be correctly displayed in the documents in their 
native applications, always look at the lowest percentage. 


A.5.10. Preferences 


OT preferences are 1-level. You can easily change them any time you want. OT will “remember” your last preferences 
when you reopen it, either with the same project or with another project. 


The only exceptions relate to Filters and Segmentation, which can be defined at general or project level, but DGT 
translators don’t have to worry about it as the IT Unit takes care of these technical aspects. 


A.5.11. View the source document # 


You can open the original document you are translating — in its native application — directly in DGT-OT by selecting 
View Source file in the Project menu. 


A.5.12. View the translated document B&@ 


There is no (real-time) preview. However, you can open the (finished or incompletely) translated document you are 
working on — in its native application — directly in DGT-OT by selecting View Target file in the Project menu. You can 
edit it if you want. 


The segments you have not translated yet will be in the source language in the native application. 


A.5.13. Printing 


You cannot print the document as seen in the OT Editor. You can only print it from its native application. 


A.5.14. Revision process B&@ 


You (if you are the translator) — and your reviser — can do all the revision work using OT, although some manipulations 
will be necessary as the workflow is not automated. 


If OT is used for revision, you and the reviser can include notes in problematic segments and “communicate” through 
them and also add or change entries in the project writable glossary. 


The segments that have been changed by the reviser can be searched with a new DGT feature — Next/Previous 
revised segment — and are highlighted with a red background to call your attention to them. 


Furthermore those segments are displayed with track-changes in the target (instead of the source) segments — in the 
Fuzzy Matches pane — so that you can easily see the changes made by the reviser and accept them or not — if you 
(as the translator) have the last word, of course. 


However, in OT you cannot accept only part of the changes in a particular segment: it's either the whole revised segment 
or your whole previous translation ... and therefore it may involve some typing. 


& On the other hand — as the track-changes are displayed in the segment in the Fuzzy Matches pane and not in 
the Editor — you don't have to do anything if you accept a revision as the text is always "clean"! 
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A.5.15. Tag management @ 


Inline formatting is displayed in the form of tags (Example: <t0/>). 


You will only see what is called “inline tags” (i.e., tags inside the segment) and not all the tags related to styles or to the 
whole segment. 


By placing the cursor over each tag, you will see its description (in a very cryptic way!). It is mostly useful to see if itis a 
footnote ... which is not to be missed! 


If you want, you can also translate with Remove Tags activated, which means that you won't have any tags at all in the 
documents ... but afterwards you (or the Unit Secretariat) will have to add the formatting, if any, manually in the 
document generated in its native application. 


In this new version of Omega’, the management of tags has been substantially improved and new features have been 
added. 


You can easily validate tags in batch whenever you choose with the Tag Validation feature. 


_& Entries with modified tags - 
= ©. ff 6 


and regulating Switzerland's participation in the ITER € que rege a participagao da Suica nas atividades do projeto 

activities carried out by Fusion for Energy is hereby ITER realizadas pela Empresa Comum Fusion for Energy Missing 
approved on behalf of the European Union, subject to the  aprovada, em nome da Unido Europeia, sob reserva da = Fix 
conclusion of that Agreement<t0/> celebragdo do referide acordo. 


This Detision shall be published in the <t0/>Offiial Journal A presente devisdo é publicada no Jomal Oficial da Unido Missing — 
of the European Union. Europeta. > Fi 


The proposal/nitiative relates to <t0i>an action redirected A proposta/iniciativa refere-se a uma agdo reorientada para i Missing 
towards a new action uma nova ac¢ao = u 
Medidas politicas que visam promover a cooperacao entre a 

<{0l>Policy measure to encourage cooperation between UE e a Confederagao Suiga, tendo em conta a importancia da 
the EU and <tf!>Euratom and Switzerland in view of the |Nestgaga0 C&T para as Partes e a implementapao conunta 
importance of S&T research for the Parties, <t2/> <t3!>on- em curso de programas de investigaco de interesse miltuo, a 

aes fim de permitir a cooperagdo € 0 acesso a atividades Missing 

<td/> <t5/>i tation of h ; 

12 sms me es — t wc at give realizadas no ambito do Programa-Quadro Horizonte 2020, do = Fix 
atcnks to acthdias canted outin Horizon 2020. Euratom Programa Euratom ITER e do Desenvolvimento da Energia de 


[TER and the Development of Fusion for Energy (F4E). Fuso para a Produgdo de Energia F 


+ 


Screenshot 5 — Tags validation by document or for the whole project 


You can also have OT warning you, before opening the next segment for translation, if it finds that there are tag 
mismatches between source and target of the active segment. By default this option is not activated. 
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A.5.16. Search in DocFinder, Quest, Euramis and IATE &&@ 


You can access these DGT applications directly in DGT-OmegaT just by clicking on the respective icon or using the 
shortcuts. 


A.5.17. Search/filter within your whole project &&@ 


In Omegar«, it is really worthwhile to explore the whole potential of OT’s sophisticated Search feature! 


) Cursors leyistation - Omneget 


Teoxt to sanarely 


J in ource NOT cectorns lequelamon ~| Memornze || 

J\ in rancenen 1") wot 7 Memorite 

J) in votes nor = Memonze 
Sarthe ted ni ali feds ano @ om 


ixpreamon moge = ——- — —- Word rode ——$—$—— = 
2 ect morch Keyword peorch Reguier exprenmoons Cone penatiee [ 2 Suiege Vole words Lemus 


16340>) - 20/02/15 09:21 - Source: <--> - Transiator: <> - Created by: <machame> 

-» ORI). “customs legisiation” means any legal or regulatory provisions applicable in Uw territories of the Parties, governing the import. export and transl of 
goods and their placing under any other customs regime or procedure, including measures of prohibition, restriction and controt 

~> TRA: «Legisiag¢Bo aduaneirar, as disposicdes legisiativas ou regulamentares aplicaveis nos territorios das Partes que regem a importacéo, a exportagéo, o 
transite de mercacorias © 3 Sua sujei¢So a qualquer regime ou procedimento aduanciro, inciuindo medidas de proibi¢do, restri¢do © controto; 


16346>) - 20/02/15 09:24 - Source: <--> - Translator: <> - Created by <machame> 
-» OR|/ “operation im breach of customs legislation” means any violation or attempted violation of customs legislation 
> TRA «Operacdes contrarias a tegisiago acuaneira». todas a6 violagées ou tentativas de viclacdo ca legisiagéo acuancira. 


16352>) - 20/02/15 09.26 - Source: <--> - Translator <> - Created by <macheme> 
~> ORE The Parties shall assist each other, in the areas within their competence, in the manner and under the conditions laid down in this Protocol, to ensure 
the correct application of the customs fegisiation, in particular by preventing, investigating and combating operations in breach of that legislation 
~> TRA: As Partes Gover prestar-se assisténcia mutua, no ambito das suas competéncias, segundo as modalidades e as condi¢ées previstas no presente 
Protecclo, tendo em vista assegurar @ correta aplicacSo da legisla; So aduaneira, nomeadamente atraves da preveng&o, investigacSo e repressdo de operates 
contraérias a essa legisiagéc 


oe +imer 


Screenshot 6 — Search window 


You can search — by exact search, keyword search and regular expressions, strings, whole words and lemmas; use the 
Booleans AND, OR and NOT; by author and translator, by date — in source or/and target segments in your project 
(translated and/or untranslated) and/or in the translation memories and/or in glossaries and/or in notes. 


A new feature in DGT-OT is that for a number of settings, you can also memorize searches to be reused in that session, 
in that particular project or in all projects. This is particularly useful for Regular Expressions. Some are already available 
by default and you can add others if you want. 


Another new feature in DGT-OT is that you can also limit the search to a translation memory or to a folder with several 
translation memories. 


In the same window, you can also filter, by those criteria, segments in your project and have them displayed in the 
Editor pane for editing, by clicking on Filter at the bottom of the Search window. 


™ Filtering in OT really refers to the editing of the searched terms/strings in the project memory and therefore it is — 
obviously — only applicable to searches in the project memory. 


A.5.18. Search in monolingual reference documents B&@ 


Contrary to the public version, in DGT-OT this feature is not integrated in the Search Project window and has a window 
of its own with added features. 


With it, you can search monolingual documents in the formats accepted by OmegaT (e.g. Office and PDF) using the 
Search Directory feature. 


@ this may be very useful — and can save you a lot of time — when you have monolingual or unaligned documents 
(for example, national legislation, standards) that are relevant to your project. 
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A.5.19. Search / Replace B® 


You can do a Replace Interactive (one by one) or a Replace all in all the documents of your project using most of the 
options available in Search Project and “preview” the segments affected before launching the Replace All operation. 


This feature has been improved in DGT-OT with some new options. 


A.5.20. Search and Pre-translate B® 


This is a brand new feature specific to DGT-OT which allows you to search by some criteria and pre-translate the 
resulting segments — either copying source to target or pre-translating them using the external memories matches or 
machine translation output. 


Ge This can be very useful, for example, when you have documents with hundreds or thousands of segments only 
with numbers that you may want to translate in a batch to have them counted as translated in statistics ... and to 
automatically go to the next segment for translation without stopping in those segments. 


™ Please note that in case you have more than one match with the best score, this screen provides no way to decide 
which one will be inserted (it depends on memory ordering, see Section F.4.3. for more details). You can check in the 
screen what will be inserted before confirming. 


A.5.21. Auto-propagation of non-unique segments 


OT automatically detects segments that are repeated 2 or more times in the document(s) in your project and the first 
time you translate one of these occurrences it automatically and instantaneously auto-propagates it — backwards and 
forwards — in all the other non-unique segments in a background operation you don't see. 


By default, those segments are also colour-coded (in grey) to call your attention to them. If the translation is later 
changed in any of the identical segments’ occurrences, all the other segments will be automatically changed without any 
need for search, search/replace on your part... but also without warning! 


A.5.22. Alternative translations and the Multiple Translations pane 


If you want the translation of a particular instance of a non-unique (repeated) segment to be different and not changed 
automatically when any other of its occurrences is changed, you can define it as an alternative translation which will be 
kept unchanged in that particular segment even if any other of the non-unique segments is changed. 


If a segment has alternative (different) translations in a project, they will be displayed in the Multiple Translations pane 
with the indication of the default translation and the alternative translation(s) with the previous/next segments and the 
number of the document in which they occur, making it a sort of “perfect match”. 


A.5.23. Orphan segments 
These are segments that you already translated — and which are therefore in your project memory — but which no 
longer exist in the documents in your project. 


This may happen when you delete — for some reason — a document from your project or when you update a project 
with a new version of one or more documents and delete the previous version. 


These segments will be displayed in the Fuzzy Matches pane and identified as “orphans”. They are displayed first 
according to the match rate. 


ag 
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A.5.24. Footnote segments 


Footnotes are treated as end notes and are always displayed at the end of the relevant document (not at the end of the 
project if it is a multi-document project). 


There is no indication or visible "link" (besides a non-descriptive tag, if you are translating with tags) between them and 
the respective paragraph. However, they will be correctly displayed in the translated documents ... as long as you have 
inserted the respective tag in the relevant segment. 


™ If you forget to insert a footnote tag when you are working with tags (default), that footnote will not be displayed 
in the translated document in its native application. 


However, if you are working with Remove Tags activated — and as in this mode there are absolutely no tags — 
the footnote number will be displayed in the translated document at the end of the paragraph and the text of the 
footnote at the bottom of the relevant page. 


A.5.25. Notes @ 


You may want to write notes while translating regarding a particular segment (for instance, concerning terminology 
problems to solve, colleagues or experts consulted/to consult, mistakes in the original) in the Notes pane when that 
segment is open. 


Segments with notes are highlighted (by default) with a background colour (BifIK) to call your (or the reviser’s) attention 
to them. 


Neither this highlight nor the actual notes will be transferred to the documents in their native applications. However, you 
can “export” all your notes to a file, if you want to discuss them with another colleague/reviser/terminologist. 


You can also identify your notes in a way that allows you to export different kinds of notes. For instance, notes for the 
terminologist or for the reviser. 


A.5.26. Auto-completion @ 


Auto-completion is a brand new Omega feature which allows to complete words from glossaries or auto-text (your own 
list of abbreviations for instance), to add (special) characters and also to insert tags. 


A.5.27. Scripting 


OmegaT has evolved a lot in the last 2 years and there is a very active community of developers contributing to its 
improvement. To facilitate the use of features that are developed around OmegaT there is now the Scripting menu, 
which allows the users to select some applications that perform certain operations not (yet) integrated in the general 
OmegaT version. 


In DGT-OmegaT a few have been selected which are available by default (with shortcuts), but you can also add others 
from the available list. 


& You can even add your own scripts... if you know how! 


A.5.28. Spellchecker @ 


The spellchecker uses the LibreOffice spellchecker and has been complemented by a script — Spellcheck — that 
allows you to correct spelling — and ignore and add words to the spellchecker — in the whole project. 


You can check the spelling in all the documents of your project or in the active document and update your dictionaries of 


learned_words and ignored_words. 
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In this update, contrary to the public OT — in which these “dictionaries” are project-specific — in DGT-OT the 
learned_words and ignored_words “dictionaries” are saved — for each language combination — in a single file in the 
_CONFIG-PERSONAL subfolder of your OmegaT_Projects folder to be used in all your projects. 


These are text only files that you can easily edit if you want to add, words or list of words in batch (provided by the 
Linguistic Coordination of your Language Department, for example) or to delete words. 


A.5.29. Quality assurance ® 


OmegaT — as many open-source applications — relies on the interaction with other applications and in DGT you can 
use XBench which is installed in all DGT computers if you want to do a more in-depth quality check. 


But now, with scripts like QA — Check Rules and QA — Show Same Segments, you can already perform in OmegaT 
a substantial number of quality checks at any stage in the translation process. 


A.5.30. Comments pane 


Some of the file formats, specialized for translation work, for instance PO, allow the inclusion of comments from the 
requester. This way the translator can be provided with the context about the segment to be translated or be given 
instructions about it. This is something that is not used in DGT documents for the moment, as far as | know. 


A.5.31. Merging and splitting segments/segmentation 


OT does not allow easily merging or splitting segments in the Editor. It can be done but it requires some manipulations. 


Segmentation is always a problem although the segmentation rules have been greatly improved to match Euramis/MT. 
But they are not perfect ... namely with poorly formatted original documents. 


But if you just translate the split or merged segment as shown in the Editor, it will be converted correctly in the document 
in its native application (even if it implies changing the order of elements in your translation). 


® Section D.1.17. if you really want to merge or split segments. The most practical way is to change/correct the original 
file and update the project. 


rr) It has an advantage: you can even merge paragraphs if you want! 


A.5.32. Adding formatting to the target segment 


In OmegaT you can omit formatting — with the exception of footnote-related tags — that is present in the source 
segment but you cannot add formatting which does not exist in the source segment. You will have to add that formatting 
in the final translated document in its native application. 


& You can use the Note feature to remind you to add the formatting later by generating a list of format-related 
notes. 
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A.6. Position and size of the panes 


You can resize the different windows/panes as in any other Windows application, by dragging the margins using the 
mouse. 


You can also undock and change the position of the different panes (Editor, Fuzzy Matches, Glossary, etc.), but | 
suggest that, when you start, you accept the defaults. 


If you have changed the OT display and you don’t like it — and you feel at a loss — you can go back to the default 
display by selecting Restore Main Window in the Options menu. 


— | a = 


Minimize Undock Maximize Restore 


Screenshot 7 — Icons to minimize, undock, maximize and restore panes 


Minimizing is done as in Windows by clicking on the respective icon at the top right side or right-clicking on the mouse 
and selecting the minimize option. 


Maximizing is done likewise by clicking on the pane’s name at the bottom of the OT window and by clicking on the 
Restore icon or right-clicking on the mouse and selecting that option. 


If you want to change the position of a pane, just click on the undock icon of the pane you want to move and, clicking on 
the blue header, drag it to the position you want (you will see a greyed area showing where the pane will be positioned). 
Release the mouse button when it is in the position and of the size you want. 
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A.7. Main shortcuts and DGT icons se 


Functions and shortcuts: As in most applications, you can use the menus (with the full features) or the shortcuts (for 
many operations). Below are listed some of the most frequently used. 


® Part Q for a full list. 


DGT icons: DGT IT Unit has added some icons for the most common operations, which are self-explanatory (place the 
cursor over them and a descriptive text will appear). 


A new icon has been added to give direct access to the full-fledged IATE interface. 


| BBS Ml DeAdAVY {7A 49 RF eT O 


123 4 5 6 7 8 9 10 12 12 13 14 1% 16 17 #18 «19 20 21 


[ Ficion [on] Sto 


New project il Anew project is usually created with the DGT-OT Wizard 
Open project 2 Ctri+O Projects are usually opened via the DGT-OT Wizard 


lf you add or delete documents, or if you change your 


F 
AOL ; 3 preferences, in an OT project 
Close project 4 Ctrl+ShifttW To close the project, but not OmegaT. 
To undo the last action, but only in the segment open in the Editor 
Undo last action 5 Ctrl+Z ™ OT has no Undo last action for operations like Replace all 
or any other. 
To redo the last action, but only in the open segment in the Editor 
Redo last action 6 CtrtY = ™® OT has no Undo last action for operations like Replace all 
or any other. 


Search project 7 CtrleE ae terms/strings — with many options — in the whole 


Search and Replace 8 Ctri+K To Search/replace — all or one by one — in the whole project. 
To close and save the segment to the project memory and to open 


Go to previous 


segment 2 poe the previous (translated or untranslated) segment for editing. 
To close and save the segment to the project memory and to open 
Got to next segment 10 Ctrl+N the next (translated or untranslated) segment for editing The 
equivalent of Return. 


Go to next To close and save the segment to the project memory and to open 
11 Ctrl+U ea 
untranslated segment the next untranslated segment for editing. 
Go to the next : To close and save the segment to the project memory and to open 
Ctrl+Shift+U Be 

translated segment the next translated segment for editing. 

Replace with match CtrltR To replace the target segment with the match selected (in bold) in 
the Fuzzy Matches pane. 
To insert in the target segment, at the position of the cursor, the 

DEE ueMey a aah match selected (in bold) in the Fuzzy Matches pane. 

Replace with Machine To copy to the target segment the machine translation output that 

: 14 Ctrl+M ae : 2 
Translation is displayed in the MT pane or to replace what is there. 
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Insert Next Missing Tag 
Insert missing source 15 
tags 


Validate tags 16 


PDockinder ey 
quest 


IATE 


Project files 


Cr 0 


Create Translated 
documents 

Create Current 
Translated Document 


Open project Folder 


Insert glossary entry 


Open writable glossary 


Go To Next/Previous 
Revised Segment 


Spellchecker 
Quality check 


Auto-completion 


Auto-completion — 


cycle 


Ctrl+T 


Ctrl+ Shift+T 


Ctrl+Shift+V 
Ctrl+Shift+D 
Ctrl+Shift+Q 


Ctrl+Shift+L 


Ctrl+Shift+E 


Ctrl+Shift+Q 
Ctri+L 


Ctrl+D 


Ctrl+Shift+D 


Ctrl+Shift+F1 
Ctrl+Shift+G 
Ctrl+Shift+F2 


Ctrl+Shift+X 
Ctrl+Shift+Y 


Ctrl+Shift+F7 
Ctrl+Shift+F3 


Ctrl+space 


Ctri+Page ft | 


eon] Stones] maton 


To insert (the first or) the next missing source tag in the target 
segment at the position of the cursor. 


To insert all the missing source tags in the target segment at the 
position of the cursor. 


To show the list of segments with missing/different/surplus tags in 
the target segment. Clicking on the segment number, OT jumps to 
that segment in the Editor for editing. 


Highlight the reference to search and it opens DocFinder. 
Highlight the term/string to search and it opens Quest 


To open IATE full-fledged interface with all search options and 
allowing the creation of new IATE entries. 


Highlight the term/string to search and it opens Euramis 
full-fledged interface with more search options. 


To quit the project and OmegaT 
To display the list of documents in the project 


To create all the translated documents in their native applications. 
OT stores them in the project \target subfolder 


To create the translated document open in the Editor in its native 
application. OT stores it in the project \target subfolder 


Within OmegaT, to open the project folder in Windows Explorer 

To create a new glossary entry 

To open the glossary in Notepad++ to change or delete entries. 
Used in the revision process for the translator to check the 
changes made by the reviser displaying — in the Fuzzy Matches 
pane — the track-changes in the target segment. 

To detect spelling mistakes in the whole project and to add 
learned/ignore words for that target language for all projects. 

To carry out several levels of quality check in the whole project 

To display and insert Glossary Entries, Auto-Text Entries, Missing 
Tags and characters/symbols from the Character Table 

To cycle between Glossary Entries, Auto-Text Entries, Missing 
Tags, Character Table 


Besides the editing functions in OT, you can also edit your segments using the general shortcuts below. These only 
apply to the segment open for editing in the Editor pane. 


|_Function_| Shortcut 
[Select all | 


Copy text 


Paste text 


Delete text 


Ctril+A 
Ctri+C 


Ctri+V 
Ctrl+X 


Selects all the text in the open segment and highlights it in blue 


Copies the selected text to be afterwards pasted into another segment (as OT 
“remembers” it even after the segment is closed) or into another application 


Pastes the copied text into another open segment in the Editor or into another application. 
Deletes the highlighted words in the segment open in the Editor. 


™ Take into consideration that the highlighted text is also deleted from the project 
memory when you validate that segment, i.e., when you open that segment again you will 
not get back your previous translation. 
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A.8. Translating in DGT-OT — in a nutshell ! 


This Section gives you a quick overview of the translation and revising process using DGT-OmegaT and its Wizard in 
order to make this more detailed Guide understandable on its own. 


Here is used an example of a standard workflow — which will probably cover the majority of your needs — with a 
standard multi-document project. 


® Relevant sections form detailed information (Detailed Index with clickable links at the end of this Guide). 


The starting point to create (or update) a project with one or several documents (with the same or different dossier 
numbers) is always to use the Local copies option in Tradesk to copy to your computer the original documents to be 
included in the project. 


By default, they are copied to the C:\Users\{your login}\AppData\lLocal\Local Documents — no 
backup\DGT\Dossiers folder. 


To manage your project, you can click on the buttons or use the shortcuts (ALT + character underlined, for example 
ALT+A). 


If the first character of each option is not underlined, just click on Alt to activate the shortcuts. 


As OmegaT doesn't accept the Office 2003 formats, if you have an “old” document still in that format, either use the ORC 
document in Tradesk (if it is a Word document) or open that/those document(s) in their native application(s) and do a 
Save as in the 2010 Office format (xlsx, pptx) before creating the project. 


A.8.1. Creating or updating a project Be 


The process to create and update a project is very similar as shown below. 


A.8.1.1. Creating a new project 


1—_ In Tradesk, do a Local copy of the document(s) to include in your project which will be automatically copied to 
the Dossiers folder (Local Documents — no backup — DGT — Dossiers ) in your computer. 


2— Click on the DGT-OT Wizard icon — ——— — in your desktop (automatically created in the installation of OT) to 
open it. 


If it is not your first project, click on Clear before starting the creation process. 
3— _ Select the source and target languages from the dropdown menu (if not already defined). 


4— Click on Add — which by default will open a Windows Explorer window in the Dossiers folder — to select the 
original document(s) previously copied to your computer. 


You have to select them one by one, even if they are in the same folder. If they are in different folders, navigate 
the folder structure to reach and select all the original documents. 


5— Inthe Windows Explorer window, click on Cancel or press ESC when you have finished selecting the documents. 


6— By default, the DGT-OT Wizard will give to the new project the number of the first document you added to the list, 
but you can change it — in the Project field — writing a name meaningful to you. 


7— _ Anextraction from IATE will be included in your project unless you untick the IATE box. 
8— Click on Create and the DGT-OT Wizard will create the OmegaT project. 


9— Click on Open (which is displayed in green when the project has been created) and it will open that project in 
Omega’. 
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A.8.1.2. Updating a project 

To update a project with a new original document or a new version of an original already in the project: 

a— Close Omegat if it is open. 

b— _ Inthe DGT-OT Wizard, check that the project you want to update is the active project in the DGT-OT Wizard. 
If not, click on Select to select that project and make it the active project. 

c— Repeat steps 1, 4 and 5 of the Section A.8.1.1 above. 

d— _ Ifyou are updating the project with a new version of a document already in the project: 


i) Click on Browse, 
ii) Select the \source folder and delete the document(s) you will be replacing with a new version, 
ili) Close the Windows Explorer window. 


e@— Click on Update and the DGT-OT Wizard will update the active project with the new document(s), wiping useless 
tags from Word documents, if any, copying the translation memories available in Tradesk and updating the IATE 
extraction. 


f— — Click on Open and it will open that project in OmegaT. 


A.8.2. Translating 


1— OT starts by displaying the list of the documents in your project. Click on Close to accept the opening of the first 
segment of the (first) document. You are now ready to start the translation of your first project. 


If you want to translate another document in a multi-document project, just press Ctr/+L (or select Project Files 
in the Project menu) and the list will be displayed again and you can select another document. 


You can also change here the order in which OT displays the documents in the Editor by highlighting the name of 
a document and clicking on Move First, Move Up, Move Down or Move Last. 


If you close and reopen that project, OT “remembers” the last segment you edited and will open that segment for 
editing (instead of the first segment of the first document). 


2— To see the statistics of your project before beginning your translation, select — in the Tools menu — the option 
you want: Statistics, Match Statistics or Match Statistics per file. Those statistics will be stored in the project 
lomegat subfolder. 


ee Getting Statistics is a fast process, but Match Statistics (per file) can take a while if you have a really big 
project (hundreds of pages). The good thing is that you can do it any time! 


3—_ There are some preferences that you may want to change right away: 
Vv Menu: Options — Font: 
Font type and size by default: Font — Dialog; size — 18 
Vv Menu: View: 
How the segments are displayed/highlighted (only target, also source, identification, etc.) 
v Menu: Options — View Options: 
Include the first non-unique segment when marking non-unique menus: By default not activated 


v Menu Options — Editing Behaviour Options: 
Minimal similarity (of automatic insertion of matches): by default 80% 
Stop at segments with multiple translations: by default not activated 
Go to Next Untranslated Segment stops when there is at least one alternative translation: by default not activated 
Validate tags when leaving a segment: by default not activated 
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4— OT displays the first segment to be translated and, by default, if there is a Euramis match above 80% (or a 
different threshold of your choosing), it will be automatically inserted in the segment open for translation. 


5— If you want to use Machine Translation (which is displayed in the MT pane) even when a Euramis match has 
been automatically inserted in the target segment, just press Ctri+M to replace the text there by the MT output. 


If there is no match within the defined threshold, OT will automatically insert the MT output (default). You can 
deactivate the automatic insertion in Options — Editing Behaviour, by unticking Insert Machine Translation. 


It is here that you can also select the Minimal Similarity threshold for the automatic insertion of Fuzzy Matches. 


6— OT will display matches in the Fuzzy Matches pane up to the number of matches defined in the Options — 
External TMXs options menu (default: 10). You can change this number to more or less (up to 50 matches). 


& OT will display matches even below the minimal similarity threshold defined the Editing Behaviour menu up to 
the number of matches defined. This may be useful sometimes as it may allow you to see matches at 
sub-segment level. 


To insert in the target segment a fuzzy match other than the first (higher) match displayed in bold in the Fuzzy 
Matches pane, position the cursor on the fuzzy match segment you want to use, double click on it (it will turn to 
bold as the selected segment) and use the shortcuts Ctrl+R or Ctrl+. 


You can also right click on the mouse and choose the option you want from the drop down menu: To Insert 
Match into Translation or To Replace Translation with Match or press Ctrl+{number of fuzzy match} (only 
for the 5 first matches) followed by Ctrl+R or Ctrl+{number of fuzzy match} and clicking on the Replace with 
Match icon. 


7—_ Translate/correct your first segment and press Enter to validate it — saving it to your project memory — and to 
open the next segment for translation. 


8— If you want to go to another segment (far) above or below, just scroll/go to that segment and double click on it to 
open it for translation. OT will save the previous segment to the project memory and open the new selected 
segment for editing. 


9— You can copy/paste, drag/drop between the Fuzzy Matches, Search and Editor panes and from applications 
outside Omega’. 


10 — To access DocFinder and Quest or to access Euramis and IATE directly, just highlight the term/string/OJ 
reference you want to search and click on the icons or use the shortcuts (Ctir+Shift+F, Ctlr+Shift+Q, 
Ctlr+Shift+E, Ctlr+Shift+L, respectively). 


11 — To insert a tag, position the cursor where you want the tag to be inserted and press Ctrl+T. OT will insert the first 
or next missing tag in that position. You can also right-click the mouse and choose the tag you want from the 
dropdown menu. 


Other options are: pressing Ctrl+Space to use the Auto-Completion feature to choose the tags in the dropdown 
menu (this is especially interesting for paired tags (bold, underline, italics)); or clicking on Icon 15 — Insert 
Missing Source Tags — to insert all missing tags at the position of the cursor; or copy/paste or drag and drop 
the tag(s) from the source segment. 


If, in the open target segment in the Editor you have a match with tags and you want to get rid of all the tags for 
some reason, press Ctrl+Shift+F5 (or select Strip tags in the Tools menu) to have the target segment cleaned 
of tags so that you can edit a “clean” segment ... probably to add different tags. 


12 — To search terms/strings in the project memory and/or the retrievals/reference translation memories and/or the 
glossaries and/or the notes, highlight them in the Editor pane, press Ctrl+F and press Enter to accept the 
defaults — or select the settings you want — in the Search window. 


OT will search for terms according to those settings. There are many options really worth exploring! 


If the search results are from your project memory, you can click on the number at the beginning of each segment 
and OT will open that segment in the Editor for editing. 


———————e 
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13— You can Search / Replace all or one by one (Replace Interactive) by pressing Ctrl+K or clicking on Search and 
Replace in the Edit menu. You also have some options. 


14— To create a glossary entry, highlight the source term you want, press Ctrl+Shift+G (or go to the Edit menu and 
click on Create Glossary Entry) and fill in at least the first 2 of the 3 fields available: source term and target term; 
the 3” field is optional and you can add there whatever (amount of) information you want. 


If it is the first entry you create in that project — and you have no previous writable glossary in the \glossary 
folder of your project — OT will automatically create a glossary for the project. 


If you want to delete or change an entry in your writable glossary, press Ctrl+Shift+F2 (or select that option in the 
Tools menu) to open the glossary in Notepad++ for editing. 


15 — The terms/strings with a blue linear and bold underline in the open segment in the Editor mean that there is an 
entry in one of the glossaries in the \glossary folder of your project (which are displayed in the Glossary pane). 
By right-clicking the mouse, the translation(s) of that term/string will be displayed in a dropdown menu 
(TransTips). 


You can select to insert a particular translation at the position of the cursor in your target segment by clicking on 
it. The terms displayed in bold in the Glossary pane are from the writable glossary. 


16— To use the Auto-Completion feature, press Ctrl+space and cycle through the options by repeatedly pressing it 
or pressing Ctrl+Page Down/Page Up to select the option you want. 


To add a new entry for auto-text (abbreviation), select Options — Auto-completion — Auto-text and add your 
new entry. 


To add a character to the character table, select Options — Auto-completion — Character table. 


17 — Alt Codes: For a non-breaking space type Alt 255 or Alt 0160, for a non-breaking hyphen Alt 0173, for a n-dash Alt 
0150, for m-dash Alt 0151 and for quotation dash/horizontal bar Alt 196 (for more, see List of Alt Key Code Symbols 
and Characters; AINSI Codes). 


You can add these to the Auto-completion — Character table or to the Auto-text list to easily insert them. 


18 — After finishing your translation, or at any time during the translation process, click on the icon 16 (Validate tags) 
or press Ctrl+Shift+V and check if OT detected anything wrong/missing. 


By clicking on the number of the segment on the left in the window that is displayed, OT will jump to the segment 
in question and open it in the Editor pane and you can correct it if needed. 


The Validate tags check is sometimes overcautious. There are tags that you may ignore... but others not! 


If there is a tag at the very beginning or end of a segment, or tags before and after a full stop in segments that 
should have been split, don’t even think of what they mean, just insert them, in the same position, in the target 
segment or otherwise it may happen that your translated documents will be corrupted. 


But, don’t worry, just correct the problem and recreate the documents and hopefully there will be no problem. 


Also if, by mistake, you insert the same tag twice in the target segment, it may happen that your document will not 
be correctly generated in its native application. 


Concerning paired tags — e.g. for italics, bold, underline — if you insert only one of them, the formatting in that 
sentence will probably be defective. However, if you do not insert both, the only problem is that the particular 
word/string will not have any formatting when converted to the document's native application. 


\ oncerning footnotes, if you do not insert the tag in the target segment, the footnote will not appear in the 
ae g footnotes, if you do not t the tag in the target segment, the footnote will not app th 
translated document in its native application, although you have translated it. So, be carefull 


However, if you are working with Remove Tags activated — and there you obviously cannot insert tags — the 
footnotes will be in the right place at the bottom of the right page and the number of the footnote will be at the end 
of the right paragraph ... which may not be the right position. You will have to check and, if necessary, change it 
in the translated document in its native application. 
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19— You can do the Spellchecking in your whole project (or in the document you are working on) by clicking on 
Ctrl+Shift+F7 or selecting it in the Tools menu. 


Here you can also add learned_words or ignored_words to your dictionary. Or you can do it in the Editor — 
one by one — by right clicking on the mouse and selecting Ignore all or Add to dictionary. 


20— You can do a quality check of your project — QA Check Rules — by clicking on Ctrl+Shift+F3 or selecting it in 
the Tools menu. You can choose the type of “errors” you want to detect by ticking/unticking each box. 


21— To create the translated document(s) in their native application(s), press Ctrl+D to generate all the target 
documents or Ctrl+Shift+D to generate only the document you are working on. They are generated in the project 
\target subfolder. 


™ Don't forget to check that you have no previous target documents opened in their native applications as in that 
case OT will not be able to generate the updated target documents in the \target subfolder. 


& It is better to do Validate tags before creating one or more translated documents. But if you don’t, OT will 
generate them — even if without some formatting — except if essential tags are missing (See point 18 above). If 
So, correct them and generate the translated document(s) again. 


22 — If you want to view the document you are currently translating press Ctrl+G (or select View target file in the 
Project menu) and OT will generate the translated document and display it in its native application. 


If you have one or more translated documents open in its/their native application(s), the operation cannot be 
completed. So, close all translated documents in their native applications before doing it. 


23 — If you want to view — or copy to another location for editing — any translated document(s) in your project, open 
the project folder (Ctrl+Shift+F1 or select that option from the Tools menu), select the \target subfolder and 
there you have the translated document(s) you generated in its/their native application(s). 


You can generate the formatted document(s) as many times as you want, taking into consideration that any 
changes you make directly (formatting or content) in the generated target document(s) will not be transferred to 
the OmegaT project memory. 


24— Every time you repeat the Create (Current) Translated Document(s) command, OT will delete all the 
documents (if any) in the \target subfolder and generate the (updated) translated document(s). 


@ if you are, for example, working in a project with a (large) number of documents and/or with different deadlines — 
and you make changes in it/them directly — it may be “wise” to create a new subfolder in your project and save 
your document(s) to that subfolder or Upload it to Tradesk so that they are not replaced by any subsequent 
creation of translated documents. 


25 — To copy the translated document(s) to Tradesk — at any moment during the translation or to be sent to the 
requester — or to send translated document memories to Euramis, @> Sections A.8.3. and A.8.4. below. 
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A.8.3. Sending translated documents to Tradesk 


If you want to copy the translated documents (even during the translation process) to make them available to everybody 
in Tradesk or to finalize them and send them to the requester: 


1— _ Check that you have no translated documents from that project open in their native applications. 
2— In OT, do Create (Current) Translated Document(s) (Ctrl+(Shift+)D). 


3— _ In Tradesk, Upload the document(s) you want from the project \target subfolder (or from any other subfolder if 
you have copied it/them to another folder). 


A.8.4. Sending translated document memories to Euramis ®&® 
If the translation is finished (revision included, if any) using OmegaT, you should send the individual memory(ies) to 
Euramis. For the moment, you must do it manually: 


1— Generate the memory(ies) to be sent to Euramis by clicking on Ctri+Shift+F8 (or Tools — Scripting — Create 
Euramis Export) to generate individual memories of your documents with the attributes required by Euramis. 


Those memories are saved in the \euramis subfolder of your project. 
2— Open the Euramis interface and select the individual memory(ies) to be sent to Euramis in the abovementioned 
subfolder. 
@ To see to which Euramis Memory are your particular document(s) to be saved, when in doubt, see the Euramis 
— Statistics & Managers — List to see Where to Save a document from a given DG, service or Cabinet 


3— You will receive an email with the confirmation that the memory of your document(s) has been saved. 
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A.9. Revising in DGT-OT — in a nutshell ! 


If your reviser wants to use OT to revise your document(s) — and if you have done all your translation work using OT, of 
course — it can be done but with some manipulations on your part as there is currently no automated revision workflow. 


Take into consideration that, in OT, there are no track-changes in the (open or closed) target segment(s) displayed in the 
Editor. There are only track-changes in the segments displayed in the Fuzzy Matches pane which, in the revision stage, 
are from the “draft” translation that is being revised and possibly from other external memories. 


As it is possible to display track-changes in the target segments too, the translator who is checking the changes made by 
the reviser — to accept them or not — can see the track-changes in the target segment in the Fuzzy Matches pane for the 
segment that is open in the Editor. 


Also take into consideration that, if you — as the translator — have the last word, you cannot accept some of the reviser’s 
changes and reject others in a particular segment. You can either accept all the changes made by the reviser or change 
manually the ones you don't accept. Or you can also reinsert your initial translation (from the Fuzzy Matches pane) and do 
the partial changes to your initial translation manually ... depending on what involves less typing work. 


& On the positive side, as the track-changes are displayed in the segment in the Fuzzy Matches pane and not in the 
Editor, you don't have to do anything if you accept all the changes the reviser has made in a particular segment — 
which is probably the most common situation — as the text is always "clean". 


Here is presented a standard workflow of a single or multi-document project that is translated by 1 translator and revised by 
1 reviser and the whole project is sent for revision at the same time... which is the most frequent situation. 


“Part N for a detailed explanation of the process and variants for complex/large projects involving 2 or more translators, 
and/or more than 1 reviser and/or a number of new versions while the project is being revised. 


A.9.1. The translator prepares the project for revision 


In Windows Explorer: 
1— Select the OmegaT_Projects folder which is under C:\Users\{your login}\AppData\Local\DGT. 


2— Do acopy of the project you want to send for revision to that same folder by highlighting the project folder name 
and pressing Ctrl+C and Ctrl+V. 


3— Rename the new project folder thus created (which Windows Explorer automatically renamed 
{name-of-the-project} copy). 
& You can name it, for instance {name-of-the-project}-FOR-REVISION. 


This way, your original project will remain intact and you can always reuse it if you want. 


In Windows Explorer, in the copy of the project for revision: 
4— Open the project folder. 


5— In the \tm folder of the project, create a new subfolder — for example called 0-DRAFT-translation — thereby 
giving it the maximum priority in the display in the Fuzzy Matches pane when the reviser does its revision work. 


6— Doacopy of the project memory (project_save.tmx file) in the project \omegat subfolder to this new subfolder. 


™ Don't delete the project memory from the \omegat subfolder. 


7— Copy the project to a location on a server that you have agreed with your reviser or which is the location used in 
your Unit/Language Department to exchange projects (or copy it to a USB key). 


8— Inform the reviser that the project is ready for revision and indicate the location. 
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A.9.2. The reviser does the revision 


1— In Windows Explorer, copy to the OmegaT_Projects folder the project for revision prepared by the translator that 
was copied to the server (or copy it from the USB key provided). 


2— Open the project as usual via the DGT-OT Wizard to have automatic backups every 10 minutes. 


3— _ The process is the same as in translation mode. You can view the segments in the Editor — with Display 
Source Segments activated in the View menu (default) — and only open the ones you want to change. 


You can also open all the segments and validate them with Return. In that case, only the segments you change 
will be identified with your login. The others will remain with the login of the translator. 


4— Inthe Fuzzy Matches pane you will always be able to see the segment open in the Editor as translated by the 
translator, identified — in the case of the example above — by 0-DRAFT-translation (name of the subfolder). 


@ As OmegaT only keeps the last modified translation in the project memory — and therefore eliminates from it the 
translator's target segment if you make any changes to it and validate it — this way you can always recheck later 
the segments you revised against the initial translation displayed in the Fuzzy Matches pane. 


5— If you want to see the original document, press Ctr/+H to view it in its native application. If you want to see the 
current translated document in its native application, press CtrltG any time during the revision. If you have 
already made changes, the translated document will already have those changes. 


6— You can see the Notes that the translator may have written and you can “communicate” with the translator writing 
your comment/answer or creating new notes. 


7— _ Inthe Glossary pane, you will see the entries from the glossaries in the project — or at least from the IATE 
extraction automatically created with the project. 


If the translator had a writable glossary, you will see the entries from that glossary in bold in the Glossary pane 
and you can also add new entries to the writable glossary or edit the glossary — to change one or more entries 
— by pressing Ctrl+Shift+F2 and editing it in Notepad++. 


8— If you want to check the changes you made in the project documents with track-changes, you can — in Options 
— External TMX Options menu — tick the box View diff in target. 


Of course, you will have to open the segment to see the track-changes displayed in the target segment(s) in the 
Fuzzy Matches pane. 

If you do that, don’t forget — after finishing the revision — to change again this setting if you are afterwards going 
to do translation (and not revision) work. 


9— If, at the end of the revision, you want to quickly check the changes you have made, just filter those segments by 
doing a Search by Regular Expressions, writing a dot (“.”) in the field In translation and your login in the field 
Translator. 


Only the segments you have changed will be displayed in the Search window and you can view them in a batch. 
You can also filter them for editing in the Editor or just click on the number of a particular segment to open it in 
the Editor. 
Don't forget to tick the box in those 2 fields (and to untick them when you no longer need it). 
10 — When you finish the revision, rename the project (for instance, {name-of-the-project-REVISED). 


11 — If you have the last word and you release the translation, just finalize your project sending translated documents 
to Tradesk and translated document memories to Euramis. 


® Sections A.8.3. and A.8.4. 


12 — If the revised translation is to be finalized by the translator — or if you want to give the revised translation to the 
translator for information purposes only — copy the project to the same server location and inform the translator. 


|_| 
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A.9.3. The translator finalizes the project 

If you are the translator and you have the last word: 

1— Copy to your computer the revised project from the server location. 

2— Open the project as usual via the DGT-OmegaT Wizard to have automatic backups. 


3— _ To easily see the segments that have been changed by the reviser, in the View menu activate Mark Revised 
Segments to have those segments marked with a red background. You can scroll the segments in the Editor to 
see those that were changed. 


4— To see the changes made by the reviser segment by segment, use the feature Go To Next/Previous Revised 
Segment (View menu) or use the shortcuts Ctrl+Shift+X and Ctrl+Shift+Y, respectively. 


OT goes to and opens for editing the next or previous segment that is identified with the login of another user, in 
this case the reviser’s login. 


5— Alternatively, if you want to quickly check the changes made by the reviser without opening all the changed 
segments in the Editor, just filter those segments by doing a Search by Regular Expressions, writing a dot (“.”) 
in the field In translation and the reviser’s login in the field Translator. 

™ Don't forget to tick the box in those 2 fields (and to untick them when you no longer need it). 


Only the segments the reviser changed will be displayed in the Search window and you can quickly check them. 
If you want, just click on the number of a particular segment to open it in the Editor ... or filter them all for editing 
in the Editor. 


6— You can accept the changes made by the reviser, reinsert your own translation from the Fuzzy Matches pane — 
as your translation is displayed first — and/or do partial changes to any of them. 


7—_ Finalize your project sending translated documents to Tradesk and translated document memories to Euramis. 


® Sections A.8.3. and A.8.4. 
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A.10. Non-unique segments / auto-propagation / 
alternative and default translation 


For projects in which there is a substantial number of repeated (non-unique) segments, auto-propagation can be a very 
interesting feature. 


However, it is important to understand how it works to take the best advantage of it as, in OmegarT, auto-propagation is 
done in the whole project — both backwards and forwards — in a background operation that you don’t see ... but also 
without any “warning”! 


As in DGT we frequently have projects with a significant number of repetitions, this OT feature is explained right away in 
this Guide so that you can fully benefit from it ... ina safe way! 


& Auto-propagation may seem a bit confusing at first, but it is a feature that is really important! 


By default, the DGT-OT Wizard creates the projects with auto-propagation activated. If you don’t want to have it in your 
project, in the Project — Properties — Edit Project menu, untick the Auto-propagation of translations option. 


Non-unique segments are identical source segments (100% including tags) that are repeated in a project — be it within 
a document and/or between documents in the case of multi-document projects. 


[<t0/>legal name of the audit firm<t1/>] 

<segment 1514 “TRA* > 

[<t0/>designa¢ao juridics ds empresa de auditoriast1/>] 
<end segment> 


[<t0/>[name and function of an authorised representative] 


nome & TunNCaO ce um representantes auiorizad 


<dd Month yyyy>,<Signature of the Auditor> 


CS dsigi- seINawle OO suche 


Procedures performed by the Auditor 
Procedimentos executados pelo auditor 


Screenshot 8 — Non-unique segments in the whole project displayed greyed — Unique segments in the 
whole project displayed in black 


When translated in the first occurrence, these segments have, by default, the status of default translation and 
auto-propagation — i.e. translation — is automatically done in the whole project changing the status of the other 
non-unique segments from non-translated to translated and all these non-unique segments are counted as translated 
in the Statistics. 


If the translator defines, in a particular segment, a translation as an alternative translation — thereby dissociating that 
particular segment from the other identical source segments in the project — the translation of that particular segment 
will not be changed if any of the other occurrences is changed. 


Those segments are marked with a as being non-unique, independently of being the default translation or 
an alternative translation. 


1 | 
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You may choose not to have the first occurrence marked as a non-unique segment and only have them marked from the 
second occurrence onwards (default) or have it greyed in the first occurrence too. 


It depends on your preferences or the phase in your work: you may want to be alerted to the fact that a particular 
segment is repeated in the text if you want to see the context of the other occurrences or you may prefer to see it just as 
anew segment. 


You can change this easily in the Options — View Options menu by ticking Include the first non-unique segment 
when marking non-unique segments any time you want. 


JE View Options 


|\V| Display all source segments in bold 


|" | Include the first non-unique segment when marking non-unique segments 


Screenshot 9 — View Options menu 


An alternative translation of a segment is defined by right-clicking on the mouse and selecting the option Create 
alternative translation from the drop-down menu displayed. 


In the example below of a listing of products quoting the EU Combined Nomenclature, “- — — — Other” can be 
translated in Portuguese in 4 ways depending on the context of a particular CN heading. 

In the case of this particular segment, the default translation is “- — — — Outros” as displayed at the top of the 
Multiple Translations pane, and there are 3 alternative translations — “- — — — Outro”, “- — — — Outras”, “- — — 


— Outra” (gender and number differences in PT) displayed with the number of the document in which they appear and 
the previous and next segments. 


Project Edit GoTo View Toots Options Help 
BwWem oe AC avy. 7849 2S'0 
Stat ~ LARK 2054 G083) 08 C2 HT -TRACO.D0CK - oO Moaagte Tranaatars 
_ ++ Of cotton: * | --+-- Outros 
-- De algoddo: 


-+- Bib and brace overalls: 
--- Jardineiras 


---- Outre 
<ELARG-2014-80031-00-03-PT-TRA-00.DOCKX> 
(2905 44 19... <...> --- Othe...) 


6203 42 59 
6203 42 59 


~~-~ Outro 
<ELARG-2014-80031-00-02-PT-TRA-00.DOCKX> 
(2009 29 19... <...> --- Ofa...) 

Transiation fast modified by machame on 04-Feb-2015 at 11:5. 
46 ~~-- Outras 
<ELARG-2014-80031-00-02-PT-TRA-00._ DOCK> 
(8207 70 37... <...> 8207 70 90...) 


| ---- Other 
<segment € 
---- Outros 
<end segnr ---~ Outras 
<ELARG-2014-80031-00-02-PT-TRA-00.DOCX> 
(8467 29 85... <__.> - Other to...) 


Add glossary entry 


6204 Set empty transiation 
6204 Remove tracalabon 
Register identical Trandianon 

Women's o r t kets, Diazers, 
dresses, sk Create Alternative Transiaticn bib and Drace 
overalls, breeches and shorts (other than swimwear): 
Fatos de saia-casaco, conjuntos. casacos. vestidos. saias, saias- 
calgas, calcas, jardineiras, caigas curtas @ cal¢des (shorts) 

| (exceto de banho), de uso feminino: 


- Ensembles: 
~ Conjuntes 


6204 21 00 
6204 21 00 


++ Of wool or fine animal hair 
) Octenery Comment: Machime Transition Glossary Motes fuzzy Matches 


Prevect etoseres on 1) 08 
7 


---- Outro 
<ELARG-2014-8003 1-00-02-PT-TRA-00. DOCX> 
(2009 49 19... <_.>---Ofa..) 


---- Outras 
<ELARG-2014-80031-00-02-PT-TRA-00,D0CX> 
(2008 40 $9... <...> --- Cont...) 


-~-~ Outras 
<ELARG-2014-80031-00-02-PT-TRA-00.DOCKX> 
(1602 39 29... <._..> 1602 50) 


~--- Outre 
<ELARG-2014-80031-00-02-PT-TRA-00,DOCX> 
(4407 10 98... <...> - Of tropi...) 


[Pasaypas2 (32041/22313, 18254)] (33714) ) 


Screenshot 10 — Non-unique segment with alternative translations 
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If the same sequence of previous, non-unique and next segments matches any of the combinations already defined as 
alternative translations, the default translation will not be auto-propagated. 


By default in DGT-OT, this applies to all the documents in the project which have that particular combination of segments 
considering that we have frequently new versions of documents which, therefore, will have a different number but in 
which you might want multiple translations to be recognised. 


In the case of a new version of a document previously in the project (and the previous version is deleted from the 
project), and if there are alternative translations, they will be correctly inserted in your new text if the new non-unique 
segments are preceded and followed by the same segments (a “perfect match’). 


However, if you want you can define that alternative translations apply only to segments in the document where they 
were first defined by unticking the option Ignore file context when identifying segments with alternative translations 
in the Options — File Filters menu. 


You can also change your mind at any time, and turn an alternative translation into a default translation by right-clicking 
on the mouse and selecting the option Use as default translation from the drop-down menu displayed. The new default 
translation will be auto-propagated to all the non-unique segments that have no alternative translation defined. 


™ So be careful and take this into consideration if you change your mind! 
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A.11. Editing — Identification of segments 


Omega? relies basically on a system of highlights and text colours to indicate segment status and other information so it 
may be worthwhile to understand how it works to take the best advantage of it. 


In the Editor, by default, the open segment displayed for translation is identified with the match rate (with and without 
formatting penalty), if there is a match from the external translation memories, or with MT if it is the output from machine 
translation. 


When you translate and validate a segment, it is saved to the project_save.tmx memory in the project \omegat 
subfolder. If you open it again, that segment is, by default, identified with your login, date and hour displayed above the 
open segment. 


When you change the translation of a segment, the previous translation is discarded, which means that after validating 
the changed translation you cannot go back to a previous version of that target segment. 


All the segments in your project memory are identified with your login with the exception of pre-translated segments, if 
any. 
If you use Pre-Translation — by adding a memory to the project tm\auto subfolder — your project memory will be 


“auto-populated” with 100% match segments (including formatting) coming from the memory(ies) you copied to that 
subfolder. 


Those segments are identified in your project memory with the login recorded in the translation memory used for 
pre-translation — which may be the login of another translator — and are displayed with an orange background (default) 
in the Editor to indicate that they were pre-translated (“auto-populated”). 


Those segments will remain so identified in your project memory unless you open and change them. If you don’t change 
them, the segment identification will remain unchanged too. 


When revision is performed in Omegat, the segments changed by the reviser will have the login of the reviser, of course. 


In the Fuzzy Matches pane, by default, each segment is identified with the name of the \tm subfolder where the file with 
that segment is stored (if any), the name of the tmx file, the date and hour of creation, the name of the source document, 
the match rate (with and without tags) and the login of the translator (if any). 


~® Part O if you want to the DGT attributes and the way they are displayed. 


By default, the match segments are displayed with track changes in the source segments. But track-changes can also 
be displayed in the target segments. This feature is used in the finalization stage when the translator — if s/he has the 
last word — checks the changes made by the reviser in order to accept them or not. 


As you have a number of options — and because it is so much simpler to show than to explain — here is an overview of 
what it looks like. Some of these features are DGT-specific and are marked with the DGT logo. 


e As you can see in the screenshots below, working with DGT-OT can be quite colourful! 


View Preferences — Default — Open segment already translated identified with the translator's login and date. 
Display source segments (with a blue background) activated and Mark translated segments inactivated. 


Translation last modified by machame on 11-Mar-2015 at 09:52:51 
ARTICLE 112 

<segment 17790 ““TRA™ >= 

ARTIGO 112.29 

<end segment> 


information and communication 
Informagaoc e comunicagao 


The Parties shall take the measures necessary to stimulate the mutual exchange of information. 
As Partes adotam as medidas necessérias para incentivar o intercambio mutuo de informacées. 
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View Preferences — Segment already translated displayed with Mark translated segments (with a light green 
background) activated. Translator and date identification for the open (and already translated) segments activated 


| Translation last modified by machame on 11-Mar-2015 at 09:52:51 
ARTICLE 112 

<segment 17790 ““TRA™ > 

|ARTIGO 112.° 

<end segment> 


\Information and communication 
informac&o e comunicacaéo 


| The Parties shall take the measures necessary to stimulate the mutual exchange of information. 
|As Partes adotam as medidas necessarias para incentivar o intercambio mutuo de informacsdes. 


View Preferences — Mark translated segments (with a light green background) and Mark non-translated segments 
(with a light yellow background) activated. Display Source Segments activated. 


Editor - ELARG-2014-80031-02-01-PT-TRA-00.DOCX 


ARTICLE 112 
ARTIGO 112.9 


Information and communication 


The Parties shall take the measures necessary to stimulate the mutual exchange of information. __ 
As Partes adotam as medidas necessarias para incentivar o intercambio mutuo de informacées. 


vr 


View Preferences — Mark translated segments and Mark non-translated segments activated. Display Source 
Segments inactivated — Translator and date identification for all translated segments activated. 


Translation last modified by machame on 11-Mar-2015 at 09:52:51 
ARTIGO 112.° 


Information and communication 


Translation last modified by machame on 11-Mar-2015 at 10:05:07 
As Partes adotam as medidas necessarias para incentivar o intercambio mutuo de informagdes. 


es) 
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View — Preferences — Default — Non-unique segments greyed. Non-unique segments can be greyed in all 
occurrences or only from the second occurrence onwards depending on the option selected in Options — View — 
Include the first non-unique segment when marking non-unique segments (unticked by default) 


Transiation last modified by machame on 02-Feb-2015 at 17'40;10 
LEGAL ELEMENTS OF THE PROPOSAL 


ELEMENTOS JURIDICOS DA PROPOSTA 


The legal basis for the signature of this agreement Is Article 217, in conjunction with Article 218(5) and the second subparagraph of Article 
218(8) of the Treaty on the Functioning of the European Union. 


Following the judgement in case C-377/12 Commission v. Council, the Commission considers that the Stabilisation and Association 
Agreement can be signed in one single act. 


A separate legal instrument applies to the European Atomic Energy Community 


Proposal fora 


View — Preferences — Segments with notes highlighted in pink (when not open for editing) (default) 


Notes 


Check with terminologist 


Editor - ELARG-2014-80031-02-01-PT-TRA-00.DOCX 


- - Cut corduroy 
- - Veludos e pelucias obtidos por trama, cortados, canelados (cételés) 


View — Preferences — Default — Mark Auto-populated Segments activated — The segments auto-populated 
(pre-translated) marked with an orange background when not open (default). The translator’ login from the reference 
document used for pre-translation (in this case costami) will remain unchanged if the translator (in this case machame) 
does not change that particular segment. 


Uditor - KTO-2012-00050-00-00-"T-TRA-00,00CK 


Translation last modified by costam/ on 20-Nov-2013 at 13:21:46 
THE EVROPEAN COMMISSION, 

<segment 0004 “*TRA™ > 

A COMISSAO EUROPEIA, 

<end segment> 


Having regard to the Treaty on the Functioning of the European Union, 
Tendo em conta o Tratado sobre o Funcionamento da Unido Europeia, 


Having regard to Council Regulation (EC) No 723/2009 of 25 June 2009 on the Community legal framework for a 
European Research Infrastructure Consortium (ERIC) (<'0/>), and in particular point (a) of Article 6(1) thereof, 


Whereas: 
Considerando o seguinte: 


| 
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Fuzzy Match — 100% matches highlighted with a green background in the target segment in the Editor 
together with the number of the segment and the 3 match rates (with and without formatting/numbers) 


Fozzy Matches 
|2-RETRIEVALSI\RTD-2013-80050-00-00-EN-ORI-00_EN-PT-RET.tmx (+14 more) 27/01/12 16:06 
Match: <100/100/100%> - Source: <RTD-2011-800750100> - Translator: <machame> 
(1) -> ORI DIFF: GENERAL PROVISIONS 
TM TRA: DISPOSICOES GERAIS 


Editor - RTD-2013-80080-00-00-FT-1RA-00.DOCK 
/<seg wad Bats sd ate ch a a 


\<end penne 


Fuzzy Match below 100% highlighted with a yellow background in the Editor, with the number of the 
segment and the 3 match rates. 


Fuzzy Matches with track changes in the Fuzzy Matches pane comparing the segment in the translation 
memories and the new document: strike-through in red for text not present in the new document and 
underlined blue coloured text for new text. 


Purzy Matched 


2-RETRIEVALS\RTD-2013-80050-00-00-EN-ORI-00_EN-PT-RET.trmx (+11 more) 27/01/12 16:06 
Match: <88/92/96%> - Source: <RTD-2011-800750100> - Transiator: <machame> 
(1) > ORI DIFF: The current membersMembers, observersObservers and their representing entities are listed 
in Annex tl. 

TM TRA: Os atuais membros, observadores e entidades que os representam estao enumerados no 
anexo 1. 


Editor - = AFO-2013- “B0050- 00-00- PT-TRA-GO,.DOCKe 


eee en 


The current Members, Observers and their representing entities are listed in Annex Il. 
<segment 0484Match 88/92/96 %> 


and Supneant 


Edited Fuzzy Match below 100% highlighted with a yellow background except the changes as they are 
made by the translator (as long as the segment is not validated). 


2-RETRIEVALS\RTD-2013-80050-00-00-EN-ORI-00__ EN-PT-RET. tmx (+5 more) 27/01/12 16: 06 
Match: <84/83/85%> - Source: <RTD-2011-800750100> - Translator: <machame> 
(1) -> ORI DIFF: the application shall describe how the applicant will contribute to CLARINthe ESS ERIC 
objectives and activities described in Article 2 and how it will fulfil its obligations referred to in Article 6.2 
Chapter 3. 
TM TRA: O pedido deve descrever o modo como o candidato contribuira para os objetivos e atividades 
da Infraestrutura CLARIN descritos no artigo 2.° e o modo como cumprira as obrigagoes referidas no artigo 6.°, = 
Editor ~ RTD-2013-800S0-00-00-PT-TRA-00,.D0CK 


the application shall describe how the applicant will contribute to the ESS ERIC objectives and activities 
described jn Article 2 and how it will fulfil its obligations referred to in Chapter 3. 
<segment 0494Match 84/83/85%> 


9 como o candidato contribu 


<end segment> 
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Machine Translation highlighted with a grey background and identified with “MT” after the segment number. 


Machu Transistion 


sretenttheataatenletesscrtrtaAbatuli ater 

The Parties shall take the measures necessary to stimulate the mutual exchange of information. 
<segment 17792 MT > 

As partes tomarSo as medidas necessarias para estimular o intercdmbio mutuo de informacdes. 


cane canment> 


Mautene Te anekation 
As partes tomardo as medidas necessarias para estimular o intercamblo mUtuo de informacées. 


Editar - ELARG-2014-80031-02-01-FT-TRA-OD, 00K 


The Parties shall take the measures necessary to stimulate the mutual exchange of information, 
<segment 17792 MT > 


As Partes adotam as medidas necessarias para incentivar o intarcdmbio mutuo de informagdes, 
<end segment> 


View — Preferences — Segments with a red background (when the segment is closed) for the revision 
stage when the option Mark Auto-populated Segments (default) is activated and Mark Revised 
Segments is activated too (2 segment below). When the segment is open, it turns to a greenish colour 
(3'¢ segment below). 


Editor - RTD-2013-80050-00-00-FT-TRA-00.00CX 


‘Article 1 


Translation last modified by costami on 20-Nov-2013 at 13:21:46 
The statutes of EATRIS ERIC are set out in the Annex.These Statutes shall be kept up to date and made 


publicly available on the website of EATRIS ERIC and at its statutory seat. 

<segment 0015 ““TRA™ > 

Os Estatutos do Consércio EATRIS-ERIC constam do anexo. Os Estatutos devem ser mantidos atualizados e colocados 
_& disposi¢do do publico no sitio Web do Consércio EATRIS-ERIC e na sua sede estatutaria. 

<end segment> 
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View — Preferences — Segments with a red background (when the segment is closed) for the revision 
stage when the option Mark Revised Segments is activated and the Mark Auto-populated Segments is 
deactivated. When the segment is open, the colour turns to green. 


| Edi@or - RTO-2013-80050-00-00-PT-TRA-D0.00CX 


Article 1 
Artigo 1.° 


Translation last modified by costami on 20-Nov-2013 at 13:21:46 

The statutes of EATRIS ERIC are set out in the Annex.These Statutes shall be kept up to date and made 
publicly available on the website of EATRIS ERIC and at its statutory seat. 

<segment 0015 ““TRA** > 

Os Estatutos do Consércio EATRIS-ERIC constam do anexo. Os Estatutos devem ser mantidos atualizados e colocados 
a disposi¢ao do publico no sitio Web do Consorcio EATRIS-ERIC e na sua sede estatutaria, 

<end segment> 


Editor — It can get quite colourful as several colour codes will mingle! 
In this example, segment colours for auto-populated segments (from Pre-Translate) plus other marks: 
e 1stand 2™ segments: auto-populated only; 
e 3% segment: auto-populated + a note; 


e Ath segment: auto-populated and open for editing. 


Introdu ction 
Introducao 


The Commission recently presented a framework for climate and energy policies in 
the period 2020 to: 2030<(0/>. 

A Comisséo apresentou recentemente um quadro sobre as politicas de clima’e de energia’ 
no periodo de 2020 a 2030-10'-. 


This framework proposes ambitious targets for greenhouse gas emissions 
reduction and renewable energy as part of the <t1/-Union<(2/>'s transition toa 
competitive low carbon economy. 

Este quadro propée objetivos ambiciosos em matéria de redu¢do das emissées de gases" 
com efeito de estufa 6 de energias renovaveis como parte integrante da transic¢ao da’ 

“t) Unido “2 /> para uma economia hipocarbsnica competitiva, 


Translation last modified by machame on 28-Jul-2014 at 14:57:33 
italso promotes: reduced energy dependency and more affordable energy for 
‘via’ internal market. 


<segment 0004 

Promove também a reduc4o da dependéncia energética e proporciona energia a precos” 
1ais acessiveis’ para‘as empresas 'e os consumidores decorrente do bom: funcionamento- 

do mercado interno. 

<end segmenr>> 


“Dictionary Mulupie Transiations T Comm mments | Glo: oeary 


‘oject autosaved on 14:00 
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— PART B— 
DGT-OT PROJECT STRUCTURE 
IN DETAIL 
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B.1. OmegaT_Projects folder me 


Before looking at the OmegaT project structure, let's first see the folder where your OT projects — and also your 
preferences files — are stored. 


The OmegaT_Projects folder is created by the DGT-OT Wizard when Omegat is installed for the first time in your 
computer in the C:\Users\{your login}\AppData\Local\DGT\ folder. 

It is here that the OT projects are created by the DGT-OT Wizard. 

When DGT-OT is first installed, the DGT-OT Wizard will also automatically create the following subfolders: 


Y _PROJECT-ARCHIVE (empty): Where you can store your finished projects just by dragging and dropping 
them from the main OmegaT_Projects folder so as not to have — over time — a long list of projects 
already finished. 


You can also simply delete finished projects as there is a copy of them in your H:drive (unless you delete 
them too!) 


yY _PROJECT- MEMORIES (empty): Where you can copy the memories of the documents finalized with OT 
and sent to Euramis. 


& Considering that Euramis strips all formatting from the segments it stores, it may be interesting to 
save here all the memories of projects that you have finished with OT for later reuse. 


For example, if new versions of already released — and heavily formatted — documents arrive, it 
would be a waste of time to reinsert all the tags again... which is what you would have to do if you 
retrieved that document memory from Euramis... and you wanted to use OmegarT of course. 


Y _CONFIG-PERSONAL: Where you will have all your preferences, memorized searches and dictionaries 
with learned and ignored words. This allows you to easily change them if you want. 


). logs 

script 

filters.xml 

omegat.prefs 

5) omegat.prefs-old 

bf pt_PT-ignored_words.txt 

bf pt_PT-learned_words.txt 
search.tsv 


©) uiLayoutxml 


Screenshot 11 — The _CONFIG-PERSONAL subfolder in the OmegaT_Projects folder. 


™ This folder is very important and you should be careful not to delete it as it contains files which 
store all your preferences — so that when you close and reopen the same or a different project — 
OT “remembers” your preferred settings and also some other information. 


However, if you delete it by accident, the default preferences will be restored when you reopen the 
DGT-OT Wizard... but you will lose your preferences. 
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B.2. DGT-OT project structure in detail me 


To easily manage (complex) projects, it may be worthwhile to take the trouble to understand the structure of a 
DGT-Omegat project. So, let’s look at it in detail. 


When you create a project with the DGT-OmegaT Project Wizard, this is the project structure that is automatically 
created. 


THE DGT-OMEGAT PROJECT STRUCTURE 
when a new project Is created via the DGT-OT Wizard 


Monolingual or bilingual 


dictionary{es} . if any 


Glossary(ies) if any - If none. OT will 
, ss create automatically a writable 
—— : glossary for the project 
Working project memory - 
one for the whole project 


Properties file created with the 


aleots —<- 
project. Dont touch it = S Rehiwetnwrnertcnbll ahee shone 


1-80034xise 


Screenshot 12 — Project created automatically by the DGT-OT Wizard. 


The project is created in the C:\Users\{your login\AppData\LocallIDGT\IOmegaT_Projects folder. 

You can access it via the DGT-OT Wizard (click on Browse), from within OmegaT with Ctrl+Shift+F1 (or via the menu 
Tool — Open Project Folder) or in Windows Explorer directly. 

™ These folders and files should not be deleted as otherwise OT may not work properly (or at alll). 


But you can organise your external translation memories and create other folders freely as, when you open the project, 
OT will accept it without any “recollection” of how it was before. 
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During the translating of the project, other folders and files may be automatically created, namely the \euramis and 
\export-omegat subfolders and the files {name of project}-level1.tmx, {name of project}-level.tmx, {name of 
project}-omegat.tmx as shown in the screenshot below. 


You can also create project subfolders, for instance, to use monolingual reference documents for search purposes or to 
copy translated documents without revision/with revision to a new folder you create as the translated documents in the 
\target folder are automatically deleted every time you do Create Translated Documents. 


So a real-life project will/may have this kind of additional folders and files. 


THE DGT-OMEGAT PROJECT STRUCTURE 
evolving in a real-life situation 


omega! 
}. References-Monolingual 


+. source 
}. TagWipe 
}. target 
i tm 


omegat. project 
©) OTStats_T-ELARG-2014-80031-80034 xisx 
© T-ELARG-2014-80031-80034-levell.tmx 
Global project memories created ® T-ELARG-2014-80031-80034-level2.tmx 
with Ctri+(Shift+) D - without notes 
© T-ELARG-2014-80031-80034-omegat.tmx 


Screenshot 13 — An example of the structure of a DGT-OmegaT project during the translation process 


Now let’s see what is the purpose and content of each folder/file. 


B.2.1. omegat.project file 


This is an essential file to open the project. If this file gets corrupted somehow, you will not be able to open the project 
and OT will display an error message. 


ho Therefore, don’t touch it! 


If this file gets corrupted (Something which happens very, very rarely in my experience) don’t panic! 


ae 


@ See Part P on Troubleshooting. 
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B.2.2. source folder 


The folder where your original formatted document(s) to be translated are kept. In DGT implementation of OmegaT, 
there is no conversion of the original documents into the xliff format. 


In the case of docx originals, these documents are cleaned of tags with the DGT TagWipe application. 
If you have documents with a subfolder structure, OmegaT will keep it. This is especially useful for web page projects. 


B.2.3. target folder 


The folder where your formatted translations in their native applications are generated when you do Create Translated 
Documents or Create Current Translated Document. 


If you have documents with a subfolder structure, OmegaT will keep it. 


B.2.4. TagWipe folder B&@ 


When the project is created with the DGT-OT Wizard, the TagWipe application is automatically made project-specific so 
that if, while translating/updating projects, TagWipe is changed/improved by the IT Unit, your project will not be affected 
and you will not have unduly untranslated segments. 


B.2.5. omegat folder B&@ 


This is a very important folder as it is here that OT saves your translated segments and also some other information. 


Contrary to the public Omega’, this folder does not contain the learned_words and ignored_words files as in DGT-OT 
these files are unique — for each language pair — and saved in the CONFIG-PERSONAL folder to be used for all 
projects. 


Here is a screenshot of what the \omegat folder may look like. 


taf files_order.txt 

4| last_entry.properties 

4 project_save.tmx 
project_save.tmx.201503271949.bak 
project_save.tmx.bak 

af project_stats.txt 

taf project_stats_match.txt 

taf project_stats_match_per_file.txt 

#) segmentation.conf 


Screenshot 14 — Files in the \omegat subfolder 


| 
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During the translation of the project, the \omegat subfolder may/will contain: 


a) project_save.tmx: The project memory file where the segments you translate in OT are stored. 


This is what an OT project memory looks like: 


| 9) preyectsave.tmex » Notepad 


<?xme) versione"1.0" encoding="UTF-8"7> 
<!DOCTYPE tmx SYSTEM “tml. dtd"> 
<tex version="1.1"> 
«header creationtool="OmegaT" o-tmfex"QmegaT TMX” adminlange"EN-US" datatypee"plaintext” creationtoolversione"3.1,.2" 


segtype="sentence” érclang="en-gb" / 
i <body> 
}i<t-- Default translations --> 
«tu» 

| <tTuv lange"en-gb"> 

<seg>(1) Membership contribution: </seg> 

/tuv> 
} <tuv lange "pt-pt" changeide"machame” changedate="2014120372020542" creat jontd="machame” 
| creationdates"20141203711084 32"> 

<seg>i) Contribui¢gdo dos membros :</seg> 


/tu 
<tu> 
<tuyv 


</tuv> 


lange "en-gb"> 


<seg>(2)41t; tO/agt; £1t;t1/Sgt; ost Premium: </seg> 
</tuv> 
<tuv lang="pteot" changeid="machame” changedate="2014120371101002" creat ionid="machame” 
creationdate."20141203T1101002"> 
<SOQ>C/41t  tO/Agt: SI tl /Agt:Préaio de acolhimenta:</seg> 
</tluve 
</tu> 
<tu> 
<tuv lange"“en-gb"> 
<sog>(3)41t; tO/Egt; £1t; t1/Sgt; AGREED CONTRIBUTIONS FOR THE PERIOD 2015-2019</sog> 
</tuv> 


_stuy lang="ptept" changeid="machame” changedate="20141203T1111392" creationid="machame™ 
creationdate="20141203T1111302"> 
<SOQg> C3) 41E; CO/40t; GIT TL /SGT CONTRIBUICOES ACORDADAS PARA 0 PERTOOO DE 7015-7019</sSeg> 
</tuv> 
</tu> 
<tu> 


Screenshot 15 — Example of an OT project memory 


This project memory includes segments that may be “orphans”, i.e., that you translated in that project 
but which no longer exist in the original(s) if you replaced one or more documents by newer versions. 


The project memory prevails over all the external memories, i.e., the segments contained in this 
memory are the ones which are displayed in the Editor pane and used to create the translated 
documents. 


There is only one exception however: “@ \tmlenforce subfolder below. 


b) project_save.tmx{time-stamp}.bak: Backups that OmegaT does periodically of the 
project_save.tmx file in the same folder (therefore locally in your computer) and every time you save 
or close your project. 


They are time-stamped and can be used if the project_save memory gets corrupted for some reason. 


This is an OmegaT feature. It has nothing to do with the backup the DGT-OT Wizard does to your 
space in the H: server to ensure that — if your computer has a crash — your work will not be lost as 
the project memory is saved outside your computer. 


Cc) project_stats.txt: Every time you select Statistics in the Tools menu OT saves the statistics results 
within the project — between source documents — in this folder. If you run Statistics again, this file 
will be replaced with the new data. 


ey That is why in DGT we have created the OT_Stats Excel sheet to record how a project evolves! 


d) project_stats_match.txt: Every time you select it in the Tools menu, OmegaT saves in this folder the 
results of the match statistics with the memories in the \tm subfolder and the segments translated in 
your project memory, if any. If you run it again, it will replace the previous file. 


e) ® project_stats_match_per_file.txt: Every time you select it in the Tools menu, OmegaT saves in 
this folder the results of the match statistics per file with the memories in the \tm subfolder and the 
segments translated in your project memory, if any. If you run it again, it will replace the previous file. 
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f) Later) segmentation.conf: When the project is created with the DGT-OT Wizard, the segmentation 
rules of your project are automatically made project-specific so that if, while translating/updating large 
projects, the segmentation rules are improved by the IT Unit, your project will not be affected (and you 
won't have unduly untranslated segments). 


® files_order.txt: File where OT stores the order of the source documents in the project if you 
reordered them using the Move buttons in the Project Files list. You can change that order at any 
time. 


Ko} 
a 


® files_entry.properties: When you reopen a project, OT “remembers” the last segment you edited 
in the last session before closing that project. This means that OT, instead of opening the first segment 
of the first document in the project as before, it will open a segment somewhere in the middle of your 
project ... if you had edited something before, of course. 


=F 
= 


B.2.6. mt folder a 


This is a DGT-specific folder as DGT doesn’t use the MT systems available in the public Omegat. 


This folder is where the Machine Translation files are automatically copied to when you create a project with the DGT-OT 
Wizard. 


In OT, machine translation is displayed separately in the Machine Translation pane and is never mixed up with human 
translation. 


B.2.7. tm folder 


Where the external memories — Euramis retrievals and aligned reference documents (in tmx format) — are kept. 


By default, the DGT-OT Wizard will copy to the \tm subfolder all the pre-processed files in Tradesk (in the dossier \pret 
folder), with the exception of machine translation files, of course. 


You can create subfolders if you want to organise your translation memories and you can also add or delete memories in 
this folder. 


It is also in this subfolder that TeamBase saves the link to a shared memory on a server (example below: 
T-MJM-EN-PT. properties) when you are working in shared mode with one or more fellow translators. 


Name aa 
Reference documents memories 


organised by subfolders/subjects a- MARY -REFERENCES 


2-Norm-Memory 


E 3-Financial-Regulation 
Reference documents memories 


(extension download — DWN) 


auto 

NoOG-2011-32011R0282R_01_EN-PT-DWN,tmx 

RTD-2013-80050-00-00-EN-ORI-00_CelexEN-PT-AL.tmx 

Retrievals memories (extension RET) RTD-2013-80050-00-00-EN-ORI-00_EN-PT-RET.tmx 

* RTD-2013-80050-01-00-EN-ORI-00_CelexEN-PT-AL.tmx 

Memory with extraction of titles from ® RTO-2013-80050-01-00-EN-ORI-00_EN-PT-RET.tmx 
Eur-Lex (extension Celex) ® RTD-2013-80050-02-00-EN-ORI-00_CelexEN-PT-AL.tmx 

© RTD-2013-80050-02-00-EN-ORI-00_EN-PT-RET.trnx 

Link to TeamBase when translating in * RTD-2013-80050-03-00-EN-ORI-00_CelexEN-PT-AL.tmx 

* T-NUM-EN-PT.properties 


share mode 


Screenshot 16 — The \tm subfolder when a project is created with the DG-OT Wizard (the subfolders — with the 
exception of the \auto subfolder — were created afterwards by me) 
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This is what an external (Euramis) memory looks like: 


£002 POONA T tee « Noskenadt 


| 

: 
t 
i 


Screenshot 17 — Example of an external (Euramis) memory 


B.2.7.1. tm\auto subfolder 


When a project is created (directly in OT or via the DGT-OT Wizard), the subfolder tmlauto is also automatically created. 


This is the subfolder (created empty) you can use for Pre-Translation (auto-population) purposes, either before 
starting translating or during the translation process. 


The tmx files you copy to this subfolder are used to pre-translate the documents in the \source folder when you open the 
project or do Reload. 


B.2.7.2. tm\tmx2source subfolder @ 


This subfolder is not automatically created when the OmegaT project is created (either via the DGT-OT Wizard or 
directly in OT). You have to create it manually in Windows Explorer if you want to use this feature. 


The target segments in the files in this subfolder will be displayed in the Editor as a second source segment. 


B.2.7.3. tm\enforce subfolder @ 


This subfolder is not automatically created when the OmegaT project is created (either via the DGT-OT Wizard or 
directly in OT). You have to create it manually in Windows Explorer if you want to use this feature. 


The segments in the files in this subfolder are the only ones that have priority over the project memory and — unlike the 
files in the \tmlauto subfolder — will even replace any segments already translated (saved in the project_save.tmx) 
and will continue to replace any segments you change afterwards. 


™ So, if you use this feature, don’t forget to delete this memory/folder, otherwise it will keep overriding the segments 
you change afterwards and which have a 100% match in this memory. 


B.2.8. glossary folder 


Where the glossary(ies) are stored. You can place here glossaries you already have and want to use in a project. 


When you create a project with the DGT-OT Wizard, it will automatically (by default) copy to this folder — if you have not 
unticked that option in the DGT-OT Wizard — an IATE extraction (source and target) and those terms are displayed in 
the Glossary pane and used by the TransTips and Auto-completion features. 


& You can also place the glossaries in any other folder outside the project folder if you change the respective path in 


Project — Properties menu. 
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B.2.9. dictionary folder 


Where dictionaries are placed if you want to use any. When a project is created using the DGT-OT Wizard, for the 
moment there is no dictionary copied to the project. 


& You can also place dictionaries in any other folder outside the project folder if you change the respective path in 
Project — Properties menu. 


B.2.10. euramis folder ®&@ 


This is a folder that is only created if you run the script Create Euramis Export in order to generate individual translation 
memories of your documents with the correct attributes to be sent to Euramis. 


This memory has all the segments of a particular document without orphan segments and notes and without alternative 
translations. 


B.2.11. export-omegat folder BE@ 


This is a folder that is only created if you run the script Create OmegaT Export in order to generate individual translation 
memories of your documents with alternative translations and notes (and without orphan segments) to be used in the 
revision process. 


& After you finalize your translation — and if you use OT until the end of the process (revision included) — you can 
generate with this script individual memories of your finalized translated documents and store them in the 
_PROJECT-MEMORIES subfolder if they have alternative translations and notes. 


B.2.12. Project memory files 


When you create the target documents with the Create Translated Documents/Create Current Translated Document 


command, OT generates 3 further memory files in the main project folder (“2 Screenshot 13 above and Section B.3. 
below): 


Y {project-name}-level1.tmx 
y {project-name}-/evel2.tmx 
y {project-name}-omegat.tmx 


These three memory files contain all the translated segments in the documents in the \source folder when the translated 
documents are generated. 


Unlike the project memory (project_save_tmx), these memories do not contain “orphan” segments, i.e. those segments 
that no longer exist in the documents in the \source folder when you generate these memories... and which may not be 
fully revised! 


These memories also do not contain the notes that may be in the project_save.txm memory. These memories are not 
accepted by Euramis as the DGT attributes are not encoded in the format required by Euramis. 


B.2.13. OT_Stats — Personal statistics Excel sheet BE@ 


When you create a project with the OmegaT Project Wizard, it will also copy to the main project folder an Excel file — 
named OT_Stats{name of the project} — in which you can record: 


a) The project statistics in an easily human-readable format; and 
b) The progress in your work if you want to have an idea of how many words/characters/pages you translated 


per hour/day/week or any other period. 
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B.3. Project memories sxe 


Because it may be a bit confusing to have so many memories of a project — i.e. of your translation work — here are the 
main features of each so that you can choose what to use or to keep. 


& Considering that Euramis cleans all the formatting in the segments it stores, | recommend that you keep, at least, 
a copy of the memory you sent to Euramis and/or the memory generated by the Create OmegaT Export and 
save them in the _PROJECT-MEMORIES folder. 


These tmx memories have all the formatting and, if you have heavily tagged documents which some time later 
may have a new version, you can use them for Pre-Translation with formatting included. 


Memory Folder inthe | Global/ | Format | Alternative Orphan | Euramis 
project by doc | (tags) translations segm format 
Yes Yes Yes Yes No 


May have 


project_save \omegat Global eeuecl 
was Generated by \export_omegat Bydoc Yes Yes Yes No No ES LENS 
Create OmegaT Export = several 


The one 
selected 


May have 


\euramis By doc Yes No No No Yes 


Main project 
folder Global No Yes No No No eaueral 


Main project May have 
folder Global Yes Yes No No No eaveral 


Mein plaice Global Yes Yes No No No May have 
folder several 


The project_save.tmx file contains all the segments that have been saved in the memory since 
you started the project (therefore including orphan segments). This file always exists in the 
project. Its contents will always be sorted alphabetically by the source segment. 


[1] Generated by A DGT adaptation of the publicly available write_sel_files2TMX script allows exporting 
Create OmegaT memories by document. Used in DGT for the revision process. Generated by pressing 
Export Ctrl+Shift+F9 and selecting the document(s) you wish to have memories exported from. 


“| Generated by A DGT adaptation of the publicly available write_sel_files2TMX script allows exporting 


Create Euramis 
Export 


{project-name}- 
level1 


{project-name}- 
level2 


memories by document with DGT attributes encoded in Euramis format. Used in DGT to send 
memories to Euramis. Generated by pressing Cirl+Shift+F8, choosing the translator’s login and 
selecting the document(s) you wish to have memories exported from. 


One memory for the whole project which contains only textual information (i.e., without tags). 
Can be used in other CAT tools. Generated when pressing Cérl(+Shift)+D. 


One memory for the whole project which encapsulates OmegaT specific tags in correct tmx 
tags so that the file can be used with its formatting information in a translation tool that 
supports tmx level 2 memories, or OmegaT itself. Generated when pressing Ctrl(+Shift)+D. 


One memory for the whole project which includes OmegaT specific formatting tags so that the 
file can be used in other OmegaT projects. Generated when pressing Ctrl(+Shift)+D. 
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— PART C — 
PREPARING AND FINALIZING 
DGT-OT PROJECTS 
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For simple projects, it is enough to create the project with the DGT-OT Wizard and have the standard translation 
memories by default available in Tradesk, as well as the IATE extraction. 


However, for large/complex projects, it may really be worthwhile to take the time to organise the project before starting... 
Or organising it or updating it with new information later on. This workflow is DGT-specific. 


C.1. Tradesk a= 


C.1.1. Local copy of the originals 


The starting point to create (or update) any project with one or several documents (with the same or different dossier 
numbers) is always to use the “Local copies” option in Tradesk to copy to your computer the original documents to be 
included in the project (by default they are copied to the Local Documents — no backup — DGT — Dossiers folder). 


You can use the Tradesk feature Local Copies to copy the originals to your computer one by one. Just select the 
document in the list of your current jobs and click on that option. 


Pan 


o 7, arean 2ayeraote 2h 
AED 2904 MOCKS O81 * Eh PT Bi tenet as nie Di naan TRA Talo: Prepon for « Count Darts on he rigring and preceane! eppiction 
RTD 2004 noo oF PT Ri tegeuce i“ rarer ny nv2ata Te OP betel of the than of the Aepowmees fer teretly ent tratenaingion 
e7D eM eee “ ree 7) neva neon ec a alae 
RTP 2364 BOObE OF 00 D ™ FT Biteences ri ier 22 4 Te =e 
TD 2906 woces On-t1 » OH FT Mi teqees a2 nian 22 w/a Te . 
RTD Jae mucet eset a FT epee “ ae rage aw S. 4 
TD 2014 SOOeEE2 Ot Ld ee bo a Rien 22) 102004 2 
ATP 204 W065 9.90 * Ce as nen wyrane nue Target documere and eubparts 
WED yore Woees COC a mF Ls 24 3/10/00 nye ew 
WTD Rohe MNES et » ee “ 2aeimepete Ry epee ew - 
RTD pate BOOMS CF n ™ FT Si epewnt a3 DaeigvRane De Rote 7 pi perey terster 
ATD 2063 Corte tr at Th ™ ft Bteqeus Be) mri eu POA ee z 
OD eth CUtse- 81-29 WA OFT Bi tepeces i mae er ow free OE | Promiman || Preen-omber dec : 
AID Orbea en ey ™~ Ft) Bl eepewer ed mene ere row (ecw mores | Upseed pect 2 
KA an FF Loe 1a mre 
a) Of Bi tepevet 1 


AID290F OFED-E) 20 " J a 4 ! wiv nw creme 
FD 200) Ohman a7 serie me pare re 
Comets oe: 


Screenshot 18 — Tradesk list of current jobs 


But if you have several documents in the same dossier, the fastest way is to select the icon for Dossier (in the 
Screenshot above) and do Local copies of all the documents you need in a single operation. 


a RTD- 2014-80063 


Lites ties 
Wh Requested - | Ongeng - HE rues - 
ata , — 
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oo 9 i EM me MM . Me MM he MM MM ee ee 
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Screenshot 19 — Tradesk list of documents in a dossier 
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C.1.2. Note for each dossier 


In Tradesk you also have the Note feature which is the means of (interinstitutional) communication and where 
sometimes you have instructions from the Planning or from the author. 


You can add notes about matters/problems related to: 
y_ The original and in that case you write the note in the General section as it applies to all languages 
y Your language and in that case you select the section for your language. 


Heeene | Fhatip Cagyertaees (Semtaet | Prefihe | Pyeterencms | Candee 


‘TRANSLATOR’S DESKTOP ethene | OUSE/DOTRFT/N | RROCUC HON 
| Sect Yom | 


BY sassy: | Coe eer ——a 
CC Oe oo 


Ai PP-2015-00248 Cerner comans © 

a 

| “decane Geimalal Seat 7 eareet ear eh eee eek et Som, See Somes ieee) SHI lo Re sete GteN SUIER Siliae cubed Mamet heed Cees dalbay 9) 
| Sin | 

| NOTE | “abd empert Prime nace Lpcere land wears anor int to TICE mentee 


A ~ 
‘Tee of the current coster, Tactoneet Ouptal Single Market 


Screenshot 20 — Tradesk note for each dossier 


C.1.3. Euramis Match Statistics for each document 


In Tradesk you also have the Match Statistics by document. 


TRANSLATOR’S DESKTOP 
=a 1 


Retrieval fesuits (fr eqQuetted match fate: TOK, 


Sages match rate 
< 20 cheraecters 50-9% characters > ++ characters Total segments charecters segment. 
Source document 10 ” 100.0% 100,08 


Screenshot 21 — Euramis Match Statistics per document available in Tradesk 
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C.1.4. Pre-processed reference documents 


Frequently, there are some reference documents for each document/dossier which are pre-processed either 
automatically (because they are referred in the original documents) or which are detected by the Pre-Processing Team. 
They are listed in Tradesk under Euramis — Available legislation. 


If you wish, download them before creating a new project. But you can also — at any time — request aligned reference 
documents in Tradesk and Update your project via the DGT-OT Wizard and those new memories will be added to your 
project. 


Retrieval Report Fes ready for import, Avadable legidation — Algnments to be corrected Reference documents to be aligned Word retrievals Other fle types 
Raterence: 3199582568 Recunt Ale 

Tile: —-Regulamerta (CE, Eucatom) n' 29BL/9E do Corselho, de 18 de Dezemtre de 1995, relative & protecedo des interemes fanceires dx: Commnidades Europe'as 

File: . 


Reference: 3195682185 Ragunt fle 


Title:  Regulamento (Euratom, CE) n° 2186/96 do Conselho de 11 de Novembro de 1994 relativa as inepecedes © verificagdes no local efectuacas pela Comissio para proteger os interesses 
financeiros das Comunidades Europelas contra 2 fraude e cutras ‘cregularidades 


File 


Reference: 3200281605 | Recuest Ale 
Title: Reguiamento (CE, Euratom) Nt.’ 1605/2002 do Consatho, de 25 de Junho de 2002, que intitut o Ragulamento Fisanceito apbcivel ao orcamanto geval das Comunidades Europelas 
File: 


Raference: 3200282342 Rogues fle 
Title: ——-Reguismranto (CE, Euratom) n.* 2342/2002 da Cominsdo, de 23 de Dezemibes de 2002, que estabelece as normas de exacurio do Regulamento (CE. Euratom) n,* 1605/200 do Comets, 
Que instiul o Regulamento Financeire aplicavel ao orcamento geval Cas Comunidades Curcpeias 
File: . 
Retereoce: 320070478 em Be 
Title: -Regulamento (CE, Euratom) n. @ 478/207 da Comitsdo, de 23 de Ard de 2007 , que altera o Regularento (CE, Euratom) n. © 2342/2002 que extabelece as normas de execucio do 
Regutamento (CE, Euratam) n.o 1605/2002 do Consetho que teatitel o Regulsments Financelro aplicavel 20 ceramento geral das Comunidades Europelas 
File; 
Raterance: 3200085081 Request Ale 
Titles —-Regulamento (UE, Euratom). ' 1081/2010 do Parlamente Europeu e do Comelno, de 24 de Novembro de 2010 , que atere o Reguiamento (CE, Euransm) n. * 1605/2002 do Consethe 
que institul o Reguiamenss Firanceirs aplicivel ao cecamsente geval cas Comanidaces Curcpelas, rio que diz respeite ao Servico Suropau para a Accio Externa 
Pile: 
Reference SI01DKO464 Fecuast fle 
Tithe: -Reegutamerito (UE, Euratom) m. * 96/2012 do Parlamento Europeu @ do Consetba, ce 25 de outuore de 2012 , relative as Ceporicdes financeiras aplicavets ac orcamento garal da Unido @ 
oue revoea o Rezulaments ((E. Furatem| n, * 1805/2002 


Screenshot 22 — Example of a list of reference documents available in Tradesk for a particular document 
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C.1.5. Copying (uploading) translated documents to Tradesk 


During the translation of a project or when you have finished translating one or all the documents in a project, you can 
use Tradesk Upload Product feature to copy translated documents to Tradesk. For the moment this operation is not 
automated. 


To copy translated document from your local project to Tradesk: 


1— 
2— 
3— 


4— 


In Tradesk, click on Upload Product (See Screenshot 18) 
Click on Browse 


Select the OmegaT_Projects folder (under C:\Users\{your login}\AppData\Local\DGT)), the project you want 


and the \target folder or any other folder where you have the documents you want to copy to Tradesk 
Click on Upload 


If there are several documents to copy to Tradesk, repeat this operation for each of them. 


Home |Help | Suggestions |Contact |Profile | Preferences [Disconnect | 


TRANSLATOR’S DESKTOP machame | (IE/CE/DGT/B/PT/1) | @ PRODUCTION. 


Upload document 
C:\Users\machame\AppDatalLocaliDGT\Omega I _Projects\SG-2015-801 13.Digital-Single-Market-Strateg | Browse..._| 


Target document/subpart information W”yY 
Requester bd 

Number 95) )) 

Part 0 


Language 9 


Screenshot 23 — Copying (uploading) documents translated in OT to Tradesk 
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C.2. Euramis = 
C.2.1 Euramis databases 


Although we think of Euramis as one (giant) Translation Memory database, it is in fact divided into several databases as 
shown below. 


Interinstitutional Translation Memories 


Final documents involved in legislative procedures. Authors: Commision (final COM documents), Council, Parliament, EESC/CoR. 


ee DG Translation Memories 


Operational DGs (only last 5 years) 
Services Internal and general services (only last 5 years) 
Web translations 


| Old DGs snd services (names used until 1999), current DGs’ old documents 


Commission - Topical Transistion Memories 


TM Description 
Budget 

Legislation and Jurisprudence 

Combined Nomenclature, etc. 

COM, SEC, other institutions, international organisations 

Standard documents 

None of these 

EU legislation published in 0) L series since July 2004 


Council - Translation Memories 
ee 
Council-Master Council documents & related background information. More details. 


European Parliament - Translation Memories 


Created in 2006, this memory contains EP committees, delegations and Plenary files, as well as files belonging to parliamentary assemblies such as the ACP-EU Joint Parliamentary 
Assembly or EuroLat. More details. 


EP-Basic-References Final versions of Basic EP reference documents: Conventions, Agreements, documents related to EP's Human Resources Policy and activities. More details. 


EESC-COR - Translation Memories 
eC 
scare «AES nt eet 


European Court of Auditors - Translation Memories 


Final ECA reports (2.9. Annual, Special and Special Annual Reports) and reference documents on audit methodology (e.9. Audit Manuals) and administrative procedures (e.g. Rules of 
Procedure). Compilation of databese under way. 


Court of Justice - Translation Memories 


This memory contains aligned texts of the Case Law of the Court of Justice : opinions (“Avis" and "Conclusions”), judgments, orders, summaries, information on unpublished 
decisions (INF, REF) views of general advocates ("Prise de position"). More details. 


European Central Bank - Translation Memories 


Published ECB documents (e.g. Annual Reports, Monthly Bulletins, Convergence Reports). ECB legal acts (e.9. Guidelines or Decisions) can be found in the OJ-L database. 
Compilation of database under way. 


Screenshot 24 — Euramis databases 
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C.2.1 Automatic Euramis retrievals 


The fact that Euramis is composed of several databases has implications in terms of retrievals, i.e., retrievals are 
pre-processed only with segments from the Euramis databases which are considered relevant depending on the 
requester service/DG/Cabinet, as shown below. Take into consideration that this may change over time! 


You can consult this list here. 


9 
1?) 
iv] 
9) 


Memory Memory 


Policies, _Legis-Juris,OJ-L Services,Llegis-Juris,O3-L 
Policies,Legis-Juris,OJ-L 
Policies,Legis-Juris,OJ-L 


ExternRelations,Legis-Juris,OJ-L 


Policies,Legis-Juris,OJ-L 
Budget,Legis-Juris,OJ-L 


Budget,Legis-Juris,OJ-L 


Budget,Services,Legis- 
Juris,Policies,ExternRelations,O3-L 


is,OJ-L 


rf 
u 
c 
3 
: 


Policies,Legis- 
Juris,ExternRelations,Services,OJ-L 


Policies, Legis-Juris,Services,OJ-L 
ExternRelations,Legis-—Juris,OJ-L 
Policies,Legis- 
Juris,ExternRelations,Services,Budget,OJ 
— 

Policies, Legis-Juris,Services,OJ-L 


Policies,Legis—Juris,OJ-L Services,Llegis-Juris,-OJ-L 


Policies tegis~2uris,servicesoo-t estar [Sons nomunclatures,O2-L 
Policies, Legis-Juris,OJ-L 
ExternRelations,Legis—Juris,OJ-L 

HOTL Web, Policies, Legis-Juris,OJ-L 
Policies,Legis-Juris,OJ-L Services,Legis-Juris,O3-L 
icai4——sissCéd@Poollicies, Legis-Juris,OJ-L TAS ——sd Services,Legis-Juris,OJ-L 
icais|——___—s[ Policies, Legis-Juris,O3-L 3RC——sS| Policies, Legis-Juris,OJ-L 
[cCAIGs|Pollicies.Legis-Juris,OV-L USTs Policies Legis—Juris,O3-L 
jcaazs*[Pollicies.Legis-Juris,OJ-L CO IMARE | Policies, Legis—Juris/OJ-L 
lcaissisd Policies, _Legis-Juris,Nomenclatures,OJ-L MOVE i Policies,Legis-Juris,OJ-L 

[cA19 Cs ExtlernRelations,Legis—Juris,OJ-L INEAR | ExternRelations,Legis-Juris,OJ-L | 
[cA20.———C*dPollicies,Legis-Juris,OJ-L_——— COB sSServicessLegis-Juris.OV-L | 
[cCA2as*|Policies,Legis-Juris,OV-L OTR sdServices,Legis-Juris.O7-L 
Policies,Legis-Juris,OJ3-L 


Policies,Legis- 
Juris,ExternRelations,Services,OJ 
-L 


Policies,Legis- 
Policies,Legis-Juris,OJ-L Juris,Services,ExternRelations,OJ 
me 


[cazs RTD 


ExternRelations,Legis-Juris,OJ-L 


Policies,Legis-Juris,OJ-L 
Policies,Legis-Juris,OJ-L 


Policies,Legis-Juris,OJ-L 


Services,Legis-Juris,O3-L 


ExternRelations,Legis-Juris,OJ-L 
Policies,Legis-Juris,OJ-L 
Policies,Legis-Juris,OJ-L 


Policies,Legis-Juris,Services,OJ-L 


Policies, Legis-Juris,OJ-L Policies,Legis-Juris,OJ-L 
Policies, Legis-Juris,OJ-L SANTE Policies, _Legis-Juris,OJ-L 
Policies,Legis-—Juris,OJ-L Services,Llegis-Juris,-O3-L 
Policies, Legis- oe ees 


Juris,Services,ExternRelations,OJ-L ry a mRelations.O3 


Policies,Legis-Juris,CoJ-Law-CaseLaw,OJ Services,Legis-Juris,CoJ-Law- 
= CaseLaw,OJ-L 


- ts = - +4 Policies,Legis- 
CouncilPosition |Budget.Legis-Juris,OJ-L TAXUD ema: Wcocsaeanee Badounea 433-8 


Policies,Legis- 


IpevVco ExternRelations,Legis-Juris,OJ-L TRADE Seni iciiowaaltcl-atireca= 4339. 
IDGT  ———Csf Policies, Legis-Juris,Services,OJ-L WEB ——_—s | Web, Policies,Legis-Juris,OJ-L 


Screenshot 25 — Translation memories consulted in an Automatic Retrieval for documents from a given DG, 
service or Cabinet 


ae 
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Retrievals from the Euramis databases related to the documents to be translated are automatically extracted, as well as 
titles from legislation and whole reference documents (if match percentage is important). These files are automatically 
copied to your project when it is created via the DGT-OT Wizard. 


In the example below, it is indicated in the Euramis report the matches from those Euramis databases: Legis-Process, 
Legis-Juris, Normative Memory and High-Frequency segments databases. 


Retrieval results (requested match rate: 70%): 


Segments Match rate 
= 30 characters 30-99 characters > 99 characters Total segments Characters Segments 
Source document 12 4 23 39 100.0% 100.0% 
7O-84% 1 Oo 4 = 12.1% 12.8% 
100% a1 4 4 19 21.0% 48.7% 


7O-84% oO Oo = = 6.0% 7-7% 
100% o Oo Oo o 0.0% 0.0% 


100% = a Oo 6 3-2% 15.4% 
100% 6 1 Oo - 2.4% 18.0% 


Screenshot 26 — Euramis databases used in pre-processing for a particular document 


C.2.2. Manual Euramis retrievals 


If you want to have retrievals from other Euramis databases, you can request them via Euramis — Translation Memory 
— Retrieval where you can select up to 5 databases, the match rate and the number of matches that are retrieved. 


You may also have to do it manually, for example, in the case of multilingual documents for which there is no 
pre-processing. 


Retrieval 


Craeratet mee Breen y 
feos > Lramalatiog marmary * Setrievs! 


TS (oe 
| l} t. Om] 
Bas tors} A 
9 Eaternftctabure 
Files setected: (1/4) > Policies 
© CMPET-2044-00040-00-00-11N-ORC-00 DOCH © Services 
| Acree 
__ Be not keep copy on seever | Other tire, 
—. 20 pot Seep copy on esever — = 
pF aset tavepnnpet sal Leares| 
je aa ote s * 2) ro a a7 * noe & om new uv Lop hets ™ 
-cs a er o n oT eT a Te “ a er ons 7 ena ‘n ab 
le OA a EN ‘ al ur C 6 ua tn t oe ut inn 
Coen Master 
» FP Conrwritios< 
| Match cate: 70% 7 Extract Cakes rote EP Donk References 
| >: — EESC-DOR 
| Matches pry memory: 3 is) " Extract COM deae iCA 
jek coe Direct only © Rewerue « Inefinert ond reverue . Ienheds mont rahe con er Comet 
| Oetpet format: © Thr wm, Lege Process 
O34 
Comoscanet- acts 
wn oe - Oe — _ ~~ mochere —y 
| Stee j Save eottings > Rewet 


Screenshot 27 — Requesting Euramis retrievals manually 
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C.2.3. Requesting aligned reference documents from Euramis 


You can also request in Euramis aligned documents to be used in your project as reference memories. You can choose 
to have them sent: 


y To the \pret folder of the relevant Tradesk dossier and, in that case, those memories will be automatically 
copied to your project when you create or update it using the DGT-OT Wizard, or 


y By email and in that case you have to copy those tmx files manually to the \tm folder of your project. 


Document Search 


Dearie done MAL TIAAM) @ | mmmeelenive (Peemy) 


Hranatation Mereory dearth 
Homer > Search > Derument Search 


3D0T ROOD 1 -o? 


Disclaieer; Results see Baset on the Wstest Duras levertory files, wich are upéatad ewory Hoh 


Reg. Sorts) Year(s) Dex. Typels) Revels 
meee - omar - ey - 0 
Camenteston \ 2036 Commtasion - 1090 

AA aor Accession Treaty AS THON ww 

AGRE 2014 Accession Trealy BG RO [30 | 

As a Acceasion Treaty £ # 

aD mip Acoension Trewty El 

Quoc aon Accession Treaty UK DK TRL N 

CAD 2010 Act EFTA Standing Comunittes 

cal ¥ 200%" Act CFT Surv, AuthorRy 

I __________ Sg _________- = 


Screenshot 28 — Euramis Document Search to request aligned memories of reference documents 


If you have several reference documents to request, the most practical thing to do is to select the option Copy file to 
pret and afterwards Update your project in the DGT-OT Wizard. 


tiara Dive MAQEADO @ Comerdnsinn (PROG) 


Searth pas anertris 


(A 
Req. 


¥ Doc. 
‘o. tyre. et 


N 


1 document found in direct mode 


: Doc. No. 
Oot Iua20001 


Gret translations onty 
odd (everse translations if found 
* odd iedivect end reverse translations if found 


Nee: you Cannot wae thas file for the purpose of correcting errors in @ Lurnaris trensistion mamory. 
Do not <avee it be central memory! = 
| |My pretties at wheel» iP 


| Request file | Copy fie to pret | Chose 


Screenshot 29 — Requesting aligned documents from Euramis to be sent by email or to the \pret dossier folder 
in Tradesk 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


C.2.4. Sending translated document memories to Euramis B® 


When you finish the translation of a document and send it to the requester — if you finished your translation in OT, 
revision included, if any — you should send the individual memory to Euramis with the correct Euramis attributes. 


For the moment, you must do it manually: 


1— Generate the memory(ies) to be sent to Euramis by clicking on Ctrl+ Shift+F8 (or Tools — Scripting — Create 
Euramis Export) and selecting the document(s) for which you want to generate memories. 


Those memories are saved in the \euramis subfolder of your project. 
2— Open the Euramis interface and select the individual memory(ies) to be sent to Euramis in the abovementioned 
subfolder. 


© To see to which Euramis Memory are your particular document(s) to be saved, if in doubt, you can — in Euramis 
— Translation Memory — Statistics & Managers — List — see Where to Save a document from a given DG, 
service or Cabinet 


Where to Save a document from a given DG, service or Cabireet 
Documents fram other institutions (codes CUCL, PL, COOL, COR and COCE) and SGVista documents (COM. SEC. C} must be saved to Curamis memory Undef, The managers will then transfer 
the data to the UtherLoc memary. 


Screenshot 30 — Present list of DGs, services and Cabinets with respective Euramis database 
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3— You will receive an email with the confirmation that the memory of your document(s) has been saved to 
Euramis... or you will receive an error message. In that latter case, check that you have selected the right Save 
to translation memory option and resubmit it. 


Save to Translation Memory 


Metts deer MACHADO @ Cootmuission (PROD) 


Tremstation Mettmery Search 
Home > Translation memary > fave to TH 

| me Ad files. | 

Files selected: (10/24) 2 
 RTD-2014-80062-00- 35 -PT-TRA-O0. xisx.trmx  RTO+2014-80062 00-32-PT-TRA-00.xisx.tmx 
® RTD-2014-80062-00-33-PT-TRA-O0,0CCX tnx ® RTO-2014-80062-00-34-PT-THA-00, DOCK. trex 
® RTD-2014-00062-00-36-PT-TRA-OD, xisx.tmx * RTD-2014-80062-00-39-PT-TRA-O9, xisx.tmx 
* RTD-2014-80062-00-39-PT-TRA-O0, DOCK. tenx  RTD-2014-80062-00-41-PT-TRA-00.xisx tmx 
 RTD-2014-00062-00-43-P7-TRA-O0. xi. tree ® RTD-2014-80062-0-45-PT-TRA-00..xlex.tmx 


Policies 5 
Submit Save settings help Rec0t | 


ian * Vn" anact* Awe» Mn beotie © Unt 


Screenshot 31 — Euramis Save to Translation Memory menu 


C.2.5. What is stored in Euramis 


In the file confirming the storing of your documents in Euramis, it is specified the number of segments kept. 


In fact, Euramis — besides cleaning all formatting information — also eliminates segments with identical source and 
target and some other segments as seen below in the example of an Euramis report. 


Save Report 


(oumnenacy) Cinueert statistics) 


Report on save to central memory Policies 
Job number: 34965 


Summary 


Pair [Read | tgnored | Rejected | Replaced] Added 


En-PT 1080 222 2 856 


Tepert statistics 


RTD-2013-00156-01-05-PT-TRA-00.DOCX.tmx 
RTD-2013-00183-01-02-0N-ORI-00, xtax.trnx | 
RTD-2013-00183-01-06-EN-ORI-00. xisx.trnx 
PETE 202 9-090 1 9-1 OF BN -ORI 00. tex ter | 


RTD-2013-00183-01-14-EN-ORI-00.xisx.tmx 
Total 


Source and target sentences are identical 
Sentence too short (ess than 3 alphabetical characters} | 93 
3 109 
Total 


Screenshot 32 — Euramis Save to Translation Memory report 
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C.3. Machine Translation = 


For Word documents in the language pairs EN —> other EU languages and some other language pairs, MT is 
automatically pre-processed for each document and is available in Tradesk and automatically copied to the \mt subfolder 
when you create or update a project with the DGT-OT Wizard. 


For other language pairs, you must request machine translation manually from the MT@EC service 


om . 
6 
. 
MT@EC - Machine Translation tog | Feemaa \ 
6 


“@ me ose eB 28 TS ve we 
u 


Cutpet format Sen Ta © 


Selection fies 


+ EMV QOL O07 KI-E-EN- ORD DOCK 68 1) 108% 


Screenshot 33 — MT@EC web interface 


To request machine translation: 


1— In Tradesk, do a local copy of the original documents if you haven't done it yet. 
2— Inthe MT@EC website, select the documents for which you want machine translation, the language pair and 
the output format. 


= Don't forget to select the output in tmx format. 
3—  Youwill receive the MT file by email. 
4— After creating the project, copy the tmx file(s) to the \mt subfolder of your project. 
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— PART D— 
MANAGING PROJECTS WITH 
AND WITHOUT THE DGT-OT 
WIZARD 
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D.1. Managing projects with the DGT-OT Wizard me 


OmegaT itself has basic management features. This is not a shortcoming; it is just the “philosophy” that project 
management is basically done via a File Manager — like Windows Explorer — that everybody already knows. 


In DGT some of the more frequent project management operations have been automated via the DGT-OT Project 
Wizard to interlink it with the DGT document management system (Tradesk). 

However, when you have confidential documents, you cannot use the DGT-OT Wizard and you will have to create the 
projects directly in OmegaT. “® Section D.3. 

With the new version of Omegar¥, it is already possible to directly access the OT project folder with the shortcut 


Ctrl+Shift+F1 (or via the menu Tools — Scripting) — which you can use instead of the Browse feature in Omegav. 
The rest of the process is, mutatis mutantis, similar to what is explained for managing projects with the DGT-OT Wizard. 


Nevertheless, using the DGT-OT Wizard is a “must” for: 
y_ The creation of projects with docx tagwiped documents, all the translation memories in Tradesk and an 
automatic extraction of IATE terms 
y The updating of projects with new documents/versions and/or new memories in Tradesk 
y The automatic backup of projects 
y Connecting to TeamBase, changing connection mode and disconnecting from it. 
™ Don't forget to always have the DGT-OT Wizard open and the project you are working on as the active project 


(selected in the Project field) to have automatic backups every 10 minutes to your space in the H:drive... except 
if you are working with confidential documents (SECEM)! 


D.1.1. General 


The ease, flexibility and user-friendliness of OmegaT concerning the creation and update of projects is of particular 
importance for DGT translators. 


For basic information on how to manage projects to start translating with DGT-OT right away, in the DGT-OT Wizard just 
click on the Quick Guide button. 


T-CNECT-2014-23-Gd-2015 


Screenshot 34 — DGT-OmegaT Project Wizard window and features 


ls 
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In the present Guide is presented detailed information on how you can optimise the management of DGT-OT projects, 
namely creating projects with a set of original documents, translation memories, machine translation output and/or 
glossary(ies); updating projects with new documents or new versions of documents already in the project, deleting 
documents/memories from the project, organising translation memories, archiving finalized projects, preparing projects 
for revision in OT and having (automatic) backups of your projects. 


It is also with the DGT-OT Wizard that you can easily connect to TeamBase to share project memories in real-time with 
colleagues who are working on the same project or on one or more related projects. 


The DGT-OT Wizard is easy to use as, for many of the project management operations, it just opens a Windows 
Explorer window and there you can copy, paste, delete, drag/drop and rename any files you want in the usual manner. 


As the DGT-OT Wizard and OmegaT do not “keep track” of all these operations, you are free to change your project as 
you want and when you reopen it, the DGT-OT and its Wizard will just accept what is there. 


To manage your project, you can click on the buttons or use the shortcuts (ALT + character underlined, for example 
ALT+A). 


If the first character of each option is not underlined, just click on Alt to activate the shortcuts. 


As OmegaT doesn't accept the Office 2003 formats, if you have an “old” document still in that format, either use the ORC 
document in Tradesk (if it is a Word document) or open that/those document(s) in their native application and do a Save 
as in the 2010 Office format (xlsx, pptx) before creating the project. 


™ Never rename the documents changing the doc, xls, ppt extension to docx, xlsx, pptx as if you do so the 
documents will not be converted to the Office 2010 format and will not be read by Omegat. 
In this section is given general information on how to manage DGT-OT projects. 


® Sections on Translation Memories and Machine Translation, Terminology and Revision for more detailed information. 


D.1.2. Features 
D.1.2.1. Guides 


You can access the following guides from the DGT-OmegaT Wizard: 
DGT Guides: 


y DGT-OmegaT 2014 and its Project Wizard — Quick Guide 


yY DGT-Omegar, its Project Wizard and DGT’s CAT Environment — A Translator’s Guide — 2014 (the Guide you 
are reading). 


Public OmegaT Guides (take into consideration that DGT-OmegaT has some adaptations) available in the public 


OmegaT website: 
y OmegaT 3.0 — User's Guide by Vito Smolej (the Guide in the OmegaT Help in pdf format for easier 
consultation) 


y Omegat for CAT Beginners by Susan Welsh & Marc Prior. 
D.1.2.2. News B® 


Information on updates — changes or improvements - in DGT-OT, its Wizard or TeamBase. 


D.1.2.3. Stats 


DGT-specific Excel sheet that enables you to record the statistical information and the progress in the translation of your 
project. 


La 
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D.1.2.4. Documents — Add and Remove 


By clicking on Add, you can select the document(s) you want the DGT-OT Wizard to create (or update) your project with. 
If more than one, you must select them one by one in the Windows Explorer window that pops up. If you want to change 
the documents added, you can remove one or more by highlighting its/their name and clicking on Remove. 


D.1.2.5. Projects — Select and browse 


To select a project already created, just click on Select and choose from the list displayed the project you want to make 
your active project. 


If you want to see what is inside a project — be it the original documents, the translation memories or any other folder — 
just click on Browse and it will open a Windows Explorer window in the folder of your current project. You can click on 
any of the subfolders to see their content and change it if you want. 


D.1.2.6. Projects — Clear 


Before creating a new project, click on Clear to be able to create a brand new project as otherwise the DGT-OT Wizard will 
update the active project defined in the field Project, if any. 


D.1.2.7. TagWipe 


More often than not documents have (many) tags that are useless (some call them the “tag soup”). 


A tag cleaning script — TagWipe — has been developed in-house for Word documents (the huge majority of our 
documents) so that you (mostly) only see the tags that are really necessary. 


If you work with tags in OmegaT (which is the default), keep the TagWipe option activated in the DGT-OT Wizard (which 
is the default too) as otherwise it is highly likely that you will see too many useless tags in OmegaT. 


As the DGT-OT Wizard will now create the project making TagWipe project-specific, if the IT Unit does any 
changes/improvements to TagWipe during the translation of a lengthy project with (many) new versions, there will be no 
change in those rules and therefore you won't have in your project unduly untranslated segments. 


If you create a project and there is an error message and the project doesn’t open, it may be due to TagWipe as 
the docx format is very complex — and “creative” — and rare tags can appear (mainly in poorly formatted 
documents) and create problems. 


~® See Part P on Troubleshooting to see how to solve the problem.... or ask for help! 
In the case of confidential documents — for which you cannot use the DGT-OT Wizard to create or update projects — 
use the TagWipe application available in your Desktop, identified with the icon 2. 


To do it, just open the \source folder of your project, highlight the document to be tagwiped and drag it to the TagWipe 
icon in your Desktop. Accept the cleaning of the document. 


D.1.2.8. IATE glossary &@ 


To take full advantage of the Glossary, Transtips and Auto-Completion features of OT, when you create a project with 
the DGT-OT Wizard, it will automatically copy to your project \glossary subfolder a read-only glossary with a filtered 
extraction from IATE containing the terms potentially relevant to your document(s). 


This file only contains the source term and the target term and OT displays in the Glossary pane all the entries for each 
segment. Furthermore, with the TransTips and Auto-Completion features you can have them displayed also in a 
dropdown menu in the Editor pane. 


This option is activated by default. You can untick it if you do not want — for some reason — to use an IATE extraction in 

your project. 

@ if you create a project with an IATE extraction and afterwards you change your mind and don’t want to use it, just 
Browse your project, select the \glossary subfolder and delete that glossary or move it to another location. 


i] 
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D.1.3. Creating a new project 


The starting point to create (or update) a project with one or several documents (with the same or different dossier 
numbers) is always to use the “Local copies” option in Tradesk to copy to your computer the original documents — in 
Office 2010 format — to be included in the project. 


By default, they are copied to the Local Documents — no backup — DGT — Dossiers folder. 


| State || Nows Other Guide} Guide | 
_C.\Users\macharne\AppOata\Local\Local Documents - no backup\DG T\Dossrers\CNEC T\CNECT 20 14-60023\CNECT 2014-80023-01 00 EN Of 
1c \Users\macharme\AppDate\Local\Local Documents » no backup\DOG T\Doxanrs\CNEC T\CNECT 2014-80023\CNEC T -2014-80023-01.-01-EN-OF 
} 


C\Users\machamelAppDats\L ocal\Local Documents « no backup\DGT\Dossers\CNEC T\CNECT -2014-80029\CNECT -2014-800273-01 02-EN- OF 


T-CNECT-2014-23-Gd-2015 


Screenshot 35 — Creating a project with the DGT-OT Wizard 


To create a new project: 
1— _ In Tradesk, do a Local copy of the original documents. 
2— Click onthe DGT-OT Wizard icon— —  —-in your desktop, if itis not open. If OmegaT is open, close it. 


3— _ If there is a previous project active (with its name in the Project field), click on Clear before creating the new 
project. 


4— Define the source and target languages (if necessary). 


™ Don't forget to check the language pair. If it is not the correct combination, the DGT-OT Wizard will not import to 
the project the retrievals and MT files as they will not be pre-processed for that other (incorrect) language 
combination! 


5— Click on Add — which will open a Windows Explorer window — to select the original document(s) previously 
copied to your computer. 


In multi-document projects, repeat the operation to add them one by one, even if they are in the same folder. If 
they are in different folders, navigate the folder structure to reach and select all the original documents. 


Click on Cancel or press ESC in the Windows Explorer window when you have finished selecting the documents. 
6— By default, the name of the new project is the name of the first document added and it is displayed in the Project 
field, but you can change it if you want. 


& In the project you can include all the documents you want regardless of having or not the same dossier number, 
therefore it might be useful to give it a meaningful name. 


i 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


7— Click on Create. 


The DGT-OT Wizard will create the DGT-OT project structure in the OmegaT_Projects folder — in System(C) 
— Users — {your login} — AppData — Local — DGT — importing the document(s) to be translated to the 
\source subfolder, the retrieval and alignment files to the \tm subfolder, the MT files, if any, to the \mt subfolder 
and the IATE extraction — by default — to the project \glossary subfolder. 


™ If there are any monolingual reference documents in Tradesk, these documents will not be automatically copied 
to the project. You will have to do it manually. 


8— — Click on Open when that button turns green (which means that the create operation is completed) and it will 
open your project in Omega’. 


9— _ Alist of the documents in the project is displayed. 
You can — at any time — change the order of the documents. 


By default, the documents are ordered alphanumerically. If you want you can, with the arrows Up, Down, Move 
First and Move Last, change that order. 


OT will create a folder in the \omegat subfolder with that information and it will “remember” it every time you 
open that project. 

& Very useful, when you have documents with different version numbers and you want to order the documents by 
(part) number ignoring the version number. 


If you have several documents and want to start translating another document first, click on the document you 
want to translate. You can always see the document list displayed again with the shortcut Ctri+L or by selecting 
Project Files in the Project menu. 


10— Close this window to start translating. OT displays the first segment of the first document for editing in the 
Editor pane. 


If you have already translated a part of the project, OT will open the project displaying for editing the last 
segment you edited in the prevision session. 


D.1.4. Translating in share mode with TeamBase B&@ 


You can work in share mode with other colleagues who are working on the same project — or related projects — either 
using Omega or the main CAT tool. 


®) Section D.2. for detailed information on TeamBase. 
To connect to a TeamBase memory: 


1— _ Inthe DGT-OT Wizard, select the project you previously created and with which you want to use TeamBase, if it is 
not the active project. 


2— _ Click on TeamBase in the DGT-OT Wizard. 


3— Select the memory you want from the list displayed in the left column or accept (changing the name or not) the 
TeamBase memory which is automatically suggested for creation. 


4— Select the mode in which you want to work: Read (receive) or Read/Write (receive/send). 
5— Click on Connect to Shared. 
6— You can start working in shared mode for that specific project. 


If you want to connect to more than one TeamBase, just repeat steps 3 and 5 as, in this case, you can only connect in 
Read mode. 
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D.1.5. Updating a project with new documents/versions 


Updating a project is very similar to creating a project: 


1— 
2— 
3— 


4— 
5— 


T= 
8 — 


In Tradesk, do a Local copy of the original documents. 
If OmegaT is open, close it. 


In the DGT-OT Wizard, select the project to be updated by clicking on Select, if it is not the active project, and 
choosing it from the list of your projects. 


Define the source and target languages (if necessary). 


Click on Add — which will open a Windows Explorer window — to select the original document(s) previously 
copied to your computer. 


In multi-document projects, repeat the operation to add them one by one. Click on Cancel or ESC in the Windows 
Explorer window when you have finished selecting the documents. 


If new version(s) of existing document(s) are added, click on Browse, select the project \source subfolder and 
delete the previous version(s) of the original document(s) that are no longer needed. 


The segments you already translated are stored in the project memory and are automatically inserted in the new 
version of your document(s) and displayed in the OT Editor pane without any other manipulations. 


Click on Update. 
When that button turns green, click on Open and it will open your project in Omega’. 


D.1.6. Pre-Translation (auto-populate) 


If you have one or more memories that you want to use for pre-translation, just: 


1— 
2— 
3— 


A 


Create (or update) your project as explained in the previous sections. 
In the DGT-OT Wizard, click on Browse to open that project folder. 


Copy/drag and drop (from another Windows Explorer window) the memory(ies) you want to use for 
pre-translation to the tm\auto subfolder. 


Open the project. 


100% match segments (tags included) will “auto-populate” your project memory and pre-translate untranslated 
segments in your documents. Those segments will be displayed in the Editor and, by default, are highlighted with an 
orange background to indicate that those are pre-translated segments. 


®) Section F.7. for more details on pre-translation using the \tmlauto subfolder. 


You can also pre-translate only a (filtered) part of your project. 


® Section F.8. for information on Search and Pre-Translate. 


™ When using Euramis memories for pre-translation, take into consideration that they may be memories of 


post-aligned documents and therefore they may contain misaligned segments! 


As the match rate is calculated between source segments, if there are misalignments in the Euramis memories, 
the target segments may not be “correct”! 
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D.1.7. Updating a project with new reference memories in Tradesk 


If you don’t want to add any new documents, but just add aligned reference documents (tmx files) that you have 
requested via Tradesk (Available legislation) or via Euramis (with copy to \pret) after creating the project, the process 
is basically the same as described in Section D.1.5. above. 


Based on the document(s) you Add, the DGT-OT Wizard will copy any new memories in \pret related to the defined 
document(s): 
1 — If Omegat is open, close it. 


2 — Select the project to be updated by clicking on Select, if it is not the active project, and check the source and 
target languages. 


3 — Click on Add — which will open a Windows Explorer window — to select the original document(s) that you had 
previously copied to your computer and which are related to the new aligned reference memories you want to copy 
from Tradesk to your project. 


& If the memories relate to documents which are part of a single dossier, you just have to add one of the documents 
in that dossier and the DGT-OT Wizard will copy all the new memories to the project. If it relates to different 
dossiers, select one document from each dossier. 


4 — Click on Cancel in Windows Explorer, or press Esc, when you have finished selecting the documents to which 
those memories refer to. 


5 — Click on Update. 
6 — Click on Open and the DGT-OT Wizard will open your project in Omega’. 


D.1.8. Updating a project with local memories 


1— There is no need to close Omega’ or the open project. 


2— Inthe DGT-OT Wizard, click on Browse. The DGT-OT Wizard will display the active project folder in Windows 
Explorer. 


3— Select the \tm subfolder 


4— Copy to it the memory files in your computer to be included in that project with copy/paste or drag&drop from 
another Windows Explorer window. 


5— You can resume your work if that project was open. If not, in the DGT-OT Wizard, click on Open. 


D.1.9. Updating a project with local original documents 


You can add documents that you have in your computer. In that case: 


1— Close Omegat. 
2— _ Inthe DGT-OT Wizard, click on Browse and a Windows Explorer window Is displayed. 
3— Copy — from another Windows Explorer window — the document(s) you want to the \source subfolder. 


™ Take into consideration that if you do not use the OmegaT Wizard (for confidential documents, for example) and 
you copy those documents directly to the \source subfolder of your project, if there are Word documents in your 
project, they will not be cleaned of useless tags (TagWipe is an operation that is performed by the DGT-OT 
Wizard, not by OmegaT). 


& If it is a Word document and it is a confidential project, do TagWipe manually using the TagWipe application 
available in your Desktop as explained in Section D.1.2.7. 


You can also work with Remove Tags on if your document has too many useless tags. 
If it is not a confidential project, you can update it via the Wizard and it will run TagWipe automatically. 
Just do Add with any document in the project and all documents will be tagwiped again. 

4— Click on Open to open the project in Omega’. 
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D.1.10. Adding glossaries to the project 


If you have personal/Unit/Department glossaries that you would like to use in your project, you can adapt them to be 
used in OT as read-only or as writable glossaries. 


® Part | for detailed information on Glossaries. 

To add one or more glossaries to your project: 

1— _ There is no need to close OmegaT or an open project. 
2— Click on Browse and select the \glossary subfolder. 
3— Copy/paste the glossary(ies) to that subfolder. 

4— Resume your work. 


If you want a glossary to be the writable glossary, the simplest thing to do is just to rename it glossary and that will be 
the glossary to where you will add terminology entries for that particular project. 


D.1.11. Adding monolingual reference documents to the project 


You may have in Tradesk — or available somewhere else — monolingual reference documents that you want to use in 
your project, like national legislation, standards, reports, etc., to search terminology using the Search Directory 
(Ctrl+Shift+K) feature in Omegat. 


You can use files in all the formats accepted by OmegaT, notably docx and pdf formats. 
® Chapter 9 of the Help in OmegaT for a list of accepted formats. 

To copy monolingual reference documents to your project: 

1— _ There is no need to close Omega’ or that project if it is open. 


2— Inthe DGT-OT Wizard, click on Browse. The DGT-OT Wizard will display the active project folder in a Windows 
Explorer window. 


3— Create a new subfolder that you name as you wish. 
4— Open that newly created folder and copy to it the monolingual file(s). 
5— You can resume your work if that project was open. If not, click on Open. 


D.1.12. Ranking memories individually or by subfolders 


You can have as many reference memories as you want as OmegaT can handle — without noticeably losing speed — 
many MB of memories in our service computers. 


Sometimes, to manage the “wealth” of information, it is worthwhile to give a priority — or a penalty — to (groups of) 
memories, and especially so in long and complex projects. 


® Section F.4. for detailed information. 

To organise your memories: 

1— You don’t need to close Omega? if your project is open. 

2 — Inthe DGT-OT Wizard, click on Browse and select the \tm subfolder of the active project you want to manage. 


There you have all the memories that were automatically copied to your project by the DGT-OT Wizard when the 
project was created or updated. 
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3— Rename those external memories — to give them priority individually (Example: 1-Financial-Regulation- 
32012R1268-EN-PT.tmx and so on) or create subfolders with the names you want — preceded by a number 
according to the priority you want to give them (Example: 1-Financial-Regulation, 2-Horizon2020) — and copy or 
drag&drop the relevant memories to those subfolders. 


4 — lf you have other local memories that you want to add to the project, just copy them (from another Windows 
Explorer window) to the \tm folder or to the subfolders you created. 


5 — You can resume your work if your project was open. If not, click on Open in the DGT-OT Wizard. 


D.1.13. Giving a penalty to memories 


Sometimes, it is useful to give priority to reliable translation memories over less reliable ones. 


Machine Translation doesn’t need to have a penalty because it is stored in a separate folder (\mt), is only displayed in its 
specific pane and is never mingled with human translations from Euramis. 


To give a penalty to one or more memories, you can create subfolders with names like "penalty-xxx" where xxx Is a 
number from 0 to 100. 


® Section F.5. for detailed information. 

To give a penalty to memories: 

1— You don't need to close OmegaT if you have your project open. 

2— Inthe DGT-OT Wizard, click on Browse and select the \tm subfolder of the active project. 

3— Create a subfolder and give it a name indicating the penalty you want to give (e.g. penalty-30). 
4— Copy the translation memory file(s) into it. 

5— You can resume your work if that project was open. If not, click on Open in the DGT-OT Wizard. 


D.1.14. Translating multilingual documents 


You can also translate multilingual documents in a single project without the need to divide the documents by source 
languages. However, the project memory — which will, by definition, be multilingual — should not be sent to Euramis. 


As these documents usually are not pre-processed, you will have to request Euramis retrievals and machine translation 
manually. 


If there are only a few segments in one of the languages, the simplest thing is to create the project with only retrievals 
and machine translation in the main language pair. If it is worthwhile, you can request it for both: 


1 — Doa local copy of the document to include in the project. 


2 — In Euramis — Retrievals, request retrievals — “@® Section C.2.2 — for both language pairs without dividing the 
documents, selecting the language pair and repeating the operation for the other language pair. 


Those retrievals will only have segments with matches... and there will be no matches for the wrong language pair. 
Choose the option of having the retrievals sent to you by email. 


3 — InMT@EC, request MT for each of the documents. “& Section C.3. 


4 — With the DGT-OT Wizard, create a new project in OmegaT defining one of the language pairs of the project (OT 
has no special code for multilingual documents). 


5 — Open the emails with the retrievals and machine translation files and copy them, respectively to the project \tm and 
\mt folders. 


6 — Open the project. 
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D.1.15. Translating with the help of a relay language (tmx2source) @ 


Anew Omegat feature is that it now allows the use of a reference translation memory with the source language identical 
to the source language of your project and the target language of the reference memory different from the target 
language of your project, displaying it in the Editor below the original source segment. 


®D Section F.6. for detailed information. 


You must copy that memory to a subfolder of the \tm folder named \tmx2source. This subfolder is not automatically 
created when the OmegaT project is created (either via the DGT-OT Wizard or directly in OT). You have to create it 
manually. 


To use this OT feature: 


1 — In Euramis, request an alignment of the document you want to have displayed as source (besides the real original) 
in the Editor pane. 


2 — Rename it with the target language code (Example: for an EN-PT project, if you want to use an EN-FR translation 
memory to see the French translation besides the EN original, rename the requested memory FR_FR). 


You can see a list of the language codes either in OT — under Help — or in the Project — Properties menu — 
Source File Language. 


3 — You don't need to close OmegaT if you have it open in that particular project. 
4 — Inthe DGT-OT Wizard, click on Browse and select the \tm subfolder of the active project. 
5 — Create a new subfolder — under the \tm folder — named tmx2source. 
6 — Copy the relevant translation memory file into it. 
™ You can only have one file in that subfolder. If you are using this feature for several documents in your project, 
either merge the tmx files or use one at a time for the particular document you are translating. 


7 — You can resume your work if that project was open. If not, click on Open in the DGT-OT Wizard. 


D.1.16. Giving absolute priority to an external memory (enforce) ® 


This is also a new OmegaT feature. You may want to give absolute priority to an external memory — even overriding the 
segments already translated and stored in your project memory. This may be the case, for instance, of a revised part of 
one or more documents. 


®) Section F.9. for detailed information. 


For that purpose, you can create a subfolder in the \tm folder — that is not automatically created when the OmegaT 
project is created (either via the DGT-OT Wizard or directly in OT) — named enforce and copy into it the memory(ies) 
you want to use: 


1—__ If OmegaT is open, close it. 
2— Inthe DGT-OT Wizard — with that project as the active project — click on Browse and select the project \tm 
folder 


3— Create a new subfolder named enforce. 
4— Copy to that subfolder the memory(ies) that you want to use for this purpose. 


5— Open the project. Segments that are 100% (including tags) in that/those memories will be transferred 
(auto-populated) to your project memory 

ual Afterwards don’t forget to delete that folder from your project unless you want that/those memories to keep 

overriding any changes you may make later to those segments! 
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D.1.17. Merging/splitting segments/paragraphs 

If may happen that you want to split or merge segments because you have a poorly formatted original — for instance 
with soft returns in the middle of a segment or with tables with segments split in 2 cells — or for any other reason. 

The easiest way is to: 


1— Open the original document(s) that you copied to the Dossiers folder (Local Documents — no backup — DGT — 
Dossiers) and make all the corrections you want (it will not affect the original in Tradesk, of course). 


2— Save and close the original document(s). 


3— _ Inthe DGT-OT Wizard, update the project with the corrected original(s) the usual way, by clicking on Add, selecting 
the corrected documents and clicking on Update. 


OT will display the original with the changes you introduced as it will segment and process the original again. 


& It has an advantage: you can even merge paragraphs if you want! 


D.1.18. Deleting memories from your project 


In the DGT-OT Wizard, the approach taken was that, in the huge majority of cases, you will want all the external memories 
available in Tradesk to be copied to your project and, as OT has no problems to deal with large amounts of tmx files, you are 
not prompted about what memories to import to the project when you create or update it. 


So, the DGT-OT Wizard just imports all the memory files in Tradesk \pret folder related to the relevant language pair. 
However, there may be projects in which you may prefer, for some reason, not to use certain memories. 

To easily delete memories from your project: 

1— You don't need to close Omegat if you have it open in that particular project. 

2— Inthe DGT-OT Wizard, click on Browse and select the \tm subfolder of the active project. 

3— Delete the tmx file(s) you don’t (any longer) want to have in the project. 

4— You can resume your work if that project was open. If not, click on Open in the DGT-OT Wizard. 


D.1.19. Sending translated documents to Tradesk 


As already explained in previous sections, if you want to copy the translated documents (even during the translation 
process) to make them available to everybody in Tradesk or to finalize them and send them to the requester: 


1— _ Check that you have no translated documents from that project open in their native applications. 
2— _ InOT, do Create (Current) Translated Document(s) (Ctri+(Shift+)D). 


3— _ In Tradesk, Upload the document(s) you want from the project \target subfolder (or from any other subfolder if 
you have copied it/them to another folder). 


D.1.20. Sending memories to Euramis ®&®@ 


As already explained in a previous section, if you want to send translated documents memories to Euramis: 


1— Generate the memory(ies) to be sent to Euramis by clicking on Ctri+Shift+F8 (or Tools — Scripting — Create 
Euramis Export) to generate individual memories of your documents with the attributes required by Euramis. 
Those memories are saved in the \euramis subfolder of your project. 


2— Open the Euramis interface and select the individual memory(ies) to be sent to Euramis in the abovementioned 
subfolder. 


ee To see to which Euramis Memory are your particular document(s) to be saved, when in doubt, see the Euramis 
— Statistics & Managers — List to see Where to Save a document from a given DG, service or Cabinet 


3— Youwill receive an email with the confirmation that the memory of your document(s) has been saved. 
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D.1.21. Preparing a project for revision in DGT-OT 


If your reviser wants to use OT to revise your document(s) — and if you have done all your translation work using OT, of 
course — you can do it but with some manipulations on your part as there is currently no automated revision workflow. 


In a nutshell: 
i. The translator prepares the project for revision and copies it to a server, 


ii. | The reviser copies to his/her computer the project for revision, does the revision work and finalizes the project if 
he/she has the last word or, if the translator has the last word, copies the revised project to the server, 


iii. If the translator has the last word, he/she copies to his/her computer the revised project, checks the changes 
made by the reviser — accepting them or not — and finalizes the project. 


® Section A.9. for the general revision workflow which will cover most of the normal work. 


® Part N for a detailed explanation of the revision process and variants for complex projects with more than one 
translator and/or reviser and/or with a number of new versions while the project is being revised. 


Here is only briefly explained the “manipulations” that have to be done to prepare a project for revision in 2 “standard” 
situations: sending the whole project for revision and sending only some documents for revision from a multi-document 
project. 


D.1.21.1. Complete project for revision 


In the more usual case of a single or multi-document project that is translated by 1 translator and revised by 1 reviser 
and all the documents in the project, either one or several, are sent at the same time for revision, the workflow is simple 
as explained in Section A.9. 


To prepare the project for revision, the translator: 
In Windows Explorer: 
1— Selects the OmegaT_Projects folder which is under C:\Users\{your login}\AppData\Local\DGT. 


2— Does a copy of the project to be sent for revision to that same folder by highlighting the project folder name and 
pressing Ctrl+C and Ctrl+V. 


3— Renames the new project folder thus created (which Windows Explorer automatically renamed 
{name-of-the-project} copy). 
© It can be renamed, for instance {name-of-the-project}-FOR-REVISION. 
This way, the original project will remain intact and it can always be reused. 
In the copy of the project for revision in Windows Explorer: 
4— Opens the project folder. 


5— Inthe \tm folder of the project, creates a new subfolder — for example called 0-DRAFT-translation — thereby 
giving it the maximum priority in the display in the Fuzzy Matches pane when the reviser does the revision work. 


6— Does a copy of the project memory (project_save.tmx file) in the project \omegat subfolder to this new 
subfolder. 
™ Don't delete the project memory from the \omegat subfolder. 


7—_ Copies the project to a location on a server that has been agreed with the reviser or which is the location used in 
Unit/Language Department to exchange projects (or copies it to a USB key). 


8— __ Informs the reviser that the project is ready for revision and indicates the location. 
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D.1.21.2. Partial project for revision 


In the case of a multi-document project that is only partially sent for revision, the translator may not want to send the 
whole project memory because one or more documents in the project may be only in a very “raw” phase and not ready 
for revision and its display in the Fuzzy Matches pane — when the reviser does his/her work — might be misleading. 


The translator can, if he/she wants, send only the memory(ies) of the documents that are ready for revision by extracting 
— from the whole project memory — only the segments from those documents. 


® Part N for a detailed explanation of the whole workflow and variants. 

To prepare the project for revision, the translator: 

In the original translation project in the OmegaT Projects folder: 

1— Closes any generated translated documents from that project that may be open in their native applications. 


2— Generates the individual translation memories to be used for revision by pressing Ctrl+Shift+F9 (or Tools — 
Create OmegaT Export) and selecting the document(s) to be sent for revision in the window that pops up. 


OT will generate memory files by document to the project \export-omegat folder. 


3—_ In Windows Explorer, does a copy of the project with Ctrl+C and Ctrl+V (to that same OmegaT_Projects folder) 
and renames it (for instance {name-of-the-project}-FOR-REVISION). 


This way, the original project will remain intact and can always be reused. 

In the copy of the project for revision: 

4— _ |n Windows Explorer, opens the project folder. 

5— Copies the memories generated in the \export-omegat folder to the \tmlauto folder. 


Optionally can rename it/them (for instance, {name-of-the-document}-DRAFT) so that there is no doubt that these 
are memories before revision. 


6— Opens the \omegat folder and deletes all the memories there. Usually there will be the project_save.tmx file and 
several backups (example: project_save.tmx.201503121120.bak or project_save.tmx.bak), leaving the other 
files in this folder as they are. 


11 — Copies the project for revision to a location on a server agreed with the reviser or which is the location used in the 
Unit/Language Department to exchange projects (or to a USB key). 


12 — Informs the reviser that the project is ready for revision and indicates the location. 


D.1.22. Backups & 


When a project is created or updated, an automatic backup is made to your H: drive (H:\CAT\OmegaT-Projects). This 
drive is itself automatically backed up by the IT Unit. 


Automatic backups: The DGT-OT Wizard updates automatically the backup of your active project every 10 minutes. 


In case of problems in your computer, the whole project backup folder can be copied to another computer or to your 
repaired computer and be used without any conversion. 


If it is copied to your service computer, copy the project to the OmegaT_Projects folder in your computer (after having 
reinstalled OmegaT if necessary). If not, you will not have the DGT-OT Wizard, but you can open the project directly in 
Omegav. 


Manual backups — By clicking on Backup in the DGT-OT Wizard, you can manually update the backup any time you 
want. 
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D.1.23. Retrieving files deleted from your project 


If you delete, by accident, any file in the OT project and you want to get it back, you can retrieve it from the backup in the 
H:drive. 


The backup that is done every 10 minutes only copies new files or modified files to the H:drive. It does not delete files 
even if they are deleted in the project folder in your computer. 


So, just open the copy of your project in the H:\CAT\OmegaT_Projects and recopy the files you want to your 
Ongoing project. 


Another way to retrieve deleted files is to Restore them from the Recycle Bin. 


D.1.24. Archiving memories of finalized translations 


Considering that Euramis strips all formatting from the segments it stores, it may be interesting to save — in a dedicated 
folder in your computer — all the memories of projects that you have finished with OT for later reuse. 


For example, if new versions of already released — and heavily formatted — documents arrive, it would be a waste of 
time to reinsert all the tags... which is what you will have to do if you retrieve that document's memory from Euramis... 
and you want to use OmegaT. 

baal Remember that in the present implementation of OmegaT in DGT, the documents are not converted to xliff 
format. 


When DGT-OT is installed, it is automatically created an empty folder in the OmegaT Projects folder named 
_PROJECT-MEMORIES to where you can copy the memories of the documents finalized with OT and sent to Euramis. 
Or you can copy them anywhere else you want, of course. 


e | suggest that you copy to this folder, at least, the memory(ies) you send to Euramis and, if you want to keep 
memories with notes and alternative translations, also the memories generated by the Create OmegaT Export 
script (Ctlr+Shift+F9). 


D.1.25. Archiving projects 


When DGT-OT is installed, an empty subfolder is automatically created in the OmegaT_Projects folder named 
_PROJECT-ARCHIVE to where you can copy the memories of the documents finalized with OT and sent to Euramis. Or 
you can copy them anywhere else you want, of course. 


1— Inthe DGT-OT Wizard, click on Select and the OmegaT_Projects folder is displayed. 
2— _ Drag and drop to the PROJECT-ARCHIVE subfolder the finalized project you want to transfer. 
& If later on you want to work on that project again — for example, because you have a new version of a document 


in that project and you want to use it — do the opposite, i.e., drag&drop that project from the 
_PROJECT-ARCHIVE subfolder to the OmegaT_Projects main folder. 


™ The active project has to be in the OmegaT_Projects folder so that you can add documents and update the 
project, have automatic backups, etc. 
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D.1.26. Deleting projects 

You may also want to delete from your local drive projects that you no longer need. 

1— Inthe DGT-OT Wizard, click on Select and the OmegaT_Projects folder is displayed. 

2— Highlight the project you want to delete and double click with the mouse right button and select Delete. 


™ If later on you want to work on that project again — for example, because you have a new version — there will 
still be a backup copy in the H:drive ... unless you deleted it too! 


D.1.27. Switching between DGT-OmegaT and another CAT tool and 
vice-versa 


OmegaT imports and exports memories in an open file format (tmx) that can be used in other CAT tools. 


Therefore you can start translating a document with OmegaT and if, for any reason, you prefer to continue translating it 
in another CAT tool, you can use the OmegaT memory of your project (and vice-versa) without losing the work already 
done. 


D.1.27.1. Switching between OmegaT and another CAT tool 


1— Press Ctrl+D to create your translated document to generate the level1, level2 and omegat memories in the 
project main folder. 


2— Import the level2.tmx file (with formatting) in the other CAT application. 


®@ You can also import the level1.tmx file (without formatting) if there are problems with the formatting. 


D.1.27.2. Switching between another CAT tool and OmegaT 


You can also do it the other way around: 

1— Export the memory in the other CAT tool to the standard tmx format. 

2— Copy it to the tmlauto folder of an OmegaT project you had already created. 
3— _ Inthe DGT-OT Wizard select that project and open it. 


The Editor window will display all the segments that exist in the memory you copied to the tmlauto folder that 
have 100% matches (formatting included). The others will be displayed in the Fuzzy Matches pane. 
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D.2. Memory sharing in real time — TeamBase me 


For projects that are translated by two or more translators — or revised by two or more revisers — it is important to be able 
to share the work that is being done in real time. 


With DGT-OT, the sharing is done using TeamBase, the DGT in-house application which is managed via the DGT-OT 
Wizard. 


With it, you can control when and how you want to work in share mode, depending on your work method and practices in 
your Unit/Language Department. 


® Sections 1.1.7. and F.10. if you also want to share glossaries and/or external translation memories with other 
translators/revisers. 


*¢ OmegaT Project Wizard (20141001) —— xa 


_ Shared memories aaa nanos Shared memories | own 


Read 


Screenshot 36 — TeamBase window 


A TeamBase memory is a bilingual memory that is created on a server and which receives — in real time — a copy of 
each translated segment validated by the translators connected to it in Read/Write mode. 
In the TeamBase window you have 3 columns which display: 

v_ All the TeamBase memories available for your language pair, in the left column. 


Here you can connect to a TeamBase in the desired connection mode (the mode displayed in the button) or 
change its connection mode and reconnect again; 


Vv The memory(ies), if any, to which you are connected, in the middle column. 


Here you can check (by highlighting the name of the memory) in what mode you are connected and it is also 
here that you disconnect from the highlighted memory; 


Vv The memories, if any, of which you are the owner, in the right column. 


™ When you close a particular project and later reopen it, you will remain connected or unconnected to TeamBase as 
you were before you closed the last session with that project. So check if you want to change anything! 


oe 
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You can connect to a TeamBase memory in Read mode only — in which case you receive segments from your colleagues, 
but you do not send your (validated) segments — or in Read/Write mode — in which case you receive segments from your 
colleagues connected to the same TeamBase in Read/Write mode and also send your segments. 


™ The selection you make will only affect the active project. Other projects that may be ongoing will remain connected 
(in either mode) or unconnected depending on the option you choose the last time you worked with it/them. 


You can share your translation — segment by segment — in real time from the very beginning by connecting to a TeamBase 
memory previously created for that particular project (or that you create yourself). 


You can also decide to only share your translated segments later when you consider that they are already sufficiently “good” 
to be of any use to others, while continuing to receive segments translated by others. 


© This feature is particularly useful if you first do a fast draft translation which you afterwards improve. 


Usually the “coordinator” of the project will create a TeamBase memory for your language pair to which others can connect 
to. That translator is considered the “owner” of that TeamBase memory and the only one who can delete it (besides the IT 
Unit, of course). 


The TeamBase memory is hosted in a server and it interferes in no way with (the memories of) your project which is in your 
computer. Each translator continues to have its own local project and it will remain “untouched”. 


You will only see — in the Fuzzy Matches pane — the segments coming from TeamBase identified with the usual attributes 
plus “TeamBase” at the beginning. 

| BwWoCM SOS AK AVI TTel4P RS rAO 
Mechns foratetert = © OC Fuh -6o 


A Conferéncia Mundial das Radiccomunicactes (WRC) ¢ o forum apropriado pare a revisho dcs | Reel DEVCO.2012-90005-EN-PT properties 20/05/15 
— de radiccomunica;ées (RR) que codificern cs espectos ransfrontetres Oa uMzaGéo do espetro £ 


Match: <100/100/100%> - Source: 
Pretesco OST TXGO Ty TOCTeTG Wh Seay eu TS er He Wes ~ [  <CNECT-2015-00030-01-00> ~ Transintor: <machame> 
The World Radiocommunication Conference (WRC) is the venue for revising the Radio Regulations (1) -> ORI DIFF: The Vioria Radiocommunication 
(RR) that codify cross-border aspects of the use of the radio spectrum, in particular by Getermining Conference (WRC) is the vente for revising the Racio 

which cadio services are allocated lo specific spectrum bands. Regulations (RR) that codify cross-border aspects of the 


| <segment 0004 ““TRA™ > use of the radio spectrum, in particular by determining 
TEST IESI TEST IESI TEST TESTIEST TEST TEST TEST which radio services are allocated to specific spectrum 
Send segment> bands. 
T™ TRA: TEST TEST TEST TEST TEST TESTTEST 
TEST TEST TEST 


Each WRC only considers a limited subset of the Radio Regulations, setting out the spectrum bands 
to be discussed and the scope of the possible outcomes based on an agenda decided af the - 


ea (ames Gy «fies tempt (TAruStCet 


Screenshot 37 — Matches from TeamBase identified in the Fuzzy Matches pane 


The TeamBase memory is created empty and will only have a copy — automatically done in real time — of the segments 
that you and other translators send for sharing by being connected to it in Read/Write mode. 


It is the validation of each segment in OmegaT (with Return) — either of a new translated segment or of a modified or 
unmodified one — that triggers the copy of that segment to the TeamBase memory (besides storing it locally in your project 
memory, of course). 

™ The segments you — or your colleagues — translate in Read mode or disconnected will not be sent to the 
TeamBase memory. 


If you are connected in Read/Write mode and you modify a segment that is already in the TeamBase memory, the previous 
translation will be replaced by the new one, i.e. the TeamBase memory only keeps the last translation of a unique segment 
with the same translator’s login. 


So, in the translation of a project, you can modify any segment several times and — in the TeamBase memory — there will 
only be your last translation of each unique segment, which will be displayed in the Fuzzy Matches pane in the other 
translators’ projects and in your own. 


You can connect to other TeamBase memories as there may be several projects ongoing that may be (distantly) related, but 
you can only connect to them in Read mode. Only the first selected memory can be in Read/Write mode. 


me 
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D.2.1. Connecting to an existing TeamBase memory 


To connect to an existing TeamBase memory: 


1 — In the DGT-OT Wizard, select the project you previously created and with which you want to use TeamBase, if it is 
not the active project. 


2 — Click on TeamBase. 

3 — Select the memory you want from the list displayed in the left column. 

4 — Select the mode in which you want to work: Read (receive) or Read/Write (receive/send). 
5 — Click on Connect to Shared 

6 — You can start working in shared mode for that specific project. 


If you want to connect to more than one TeamBase, just repeat steps 3 and 5 as, in this case, you can only connect in 
Read mode. 


D.2.2. Creating a new TeamBase memory 


To create a new TeamBase memory you have first to create the project using the DGT-OT Wizard, if you haven't done it 
yet. 


1 — In the DGT-OT Wizard, select the project with which you want to use TeamBase if it is not the active project. 


2— The name of your project will be automatically inserted in the field at the bottom of the Available Shared 
Memories column. 


If you want, you can change it to a name that is meaningful to you and your colleagues. 


3— To work on that project, select the mode in which you want to connect: Read (receive) or Read/Write (receive and 
send). 


The other translators may choose, at different times, to work in either mode. 
4 — Click on Connect to Shared 


5 — You can start working on your project. 


D.2.3. Changing the connection mode 


If you want to change from Read/Write to Read only — or vice-versa: 

1 — With the project in question selected in the DGT-OT Wizard, click on TeamBase 
2 — Inthe middle column, click on the name of the memory you are sharing 

3 — Click on the option displayed — Read/Write or Read — to change the mode. 

4 — Close the TeamBase window. 

5— You can resume your work. 


If you change from Read/Write to Read, you will no longer be sending the segments you translate from then on to the 
TeamBase memory, but you will continue to receive segments from the others. 


If you change from Read to Read/Write you will start sending the segments you validate after changing the connection 
mode. 


™ The segments you may have translated in Read mode or disconnected have not — and will not — be sent to the 


TeamBase memory. 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


D.2.4. Disconnecting from a TeamBase memory 


You can disconnect from a TeamBase memory at any time: 

1 — With the project selected in the DGT-OT Wizard, click on TeamBase 
2 — Select the name of the memory in the middle column 

3 — Click on Disconnect From Shared. 

You will no longer receive nor send segments from/to that memory. 


If you are connected to more than one TeamBase memory, the other(s) will remain connected. If you want to disconnect 
from all, you have to disconnect one by one. 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


D.3. Managing projects directly in Omegal - Translation 
of confidential documents 


When you create projects directly in Omega’, you will not have: 
1— Automatic backups to the H:drive, so take appropriate measures concerning backups. 


2— Your documents, if they are docx, will not be automatically wiped of useless tags and you will have to use the 
TagWipe available in your Desktop or work in OmegaT with the “Remove Tags” options (in the Project — 
Properties menu) activated if your document(s) have too many useless tags. 


3— You will not have automatically retrievals and machine translation copied to your project from Tradesk. You will 
have to do it manually. 


You can create the project in your computer or in a USB key, depending on the security instructions. 


D.3.1. Creating a project directly in OmegaT 


1— In Tradesk, copy the originals to your computer. They will be, as usual, copied to the Dossiers folder 
(C:\Users\{your login}\AppData\Local\Local Documents — no backup\DGT\Dossiers) 


2— Inthe DGT-OT Wizard, open any project to open OmegaT as in DGT the access to Omegat is done via the 
DGT-OT Wizard. 


3— Close the DGT-OT Wizard so that it does not do automatic backups to a server. 
4— _ |nOmegar, close the project (Project — Close). 


5— Create a folder where you want to create your OT project or use the OmegaT_Projects folder (or a subfolder of 
it) in your computer. 


6— Inthe Project menu (or with the shortcut or the icon), click on New 


Project] Edit GoTo View Tools Options Help 
Mm New Ctrl+Shift+N 
Team Project 


Open Ctri+O 


al 


Copy Files to Source Folder... 
Download MediaWiki Page... 


& Reload F5 
@ Close Ctrl+Shift+W 
Save Ctrl+S 
Create Transtated Documents Ctrl+D 
View source file Ctrl+H 
View target file Ctri+G 
Create Current Translated Document Ctrl+Shift+D 
Properties... Cirl+E 
Project Files... Ctri+L 
Quit Ctri+Q 
Screenshot 38 — Project menu 
7— In the Windows Explorer window that pops up, select — in the field Save in — the folder in which you want to 


create your project and — in the field Folder name — the name you want to give to your new project. 
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8— Inthe Create New Project window, select the source and target languages of your project. 


It is here that you can also change the location of your project at any time. But for the moment accept the 
defaults. 


[MD Create New Project =r = — =" cae § es a een oe + beabSe | 


Please select the language of your source files and the target 
language you would lke to translate to. 
Here you may also specify custom location of project folders. 
Languages 
Source Files Language: Source Language Tokenizer: Source Tokenizer Dehavier: 
y , 
en-gb ~ LucenelinglishTokenizer 
Translated Files Language: Target Language Tokenizer: Target Tokenizer Behavior: 
| pt-pt ~ | LucensPortugueseTokenizer ~)* Lucene current 


Options 
|¥) Enable Sentence-level Segmenting Segmentation... | 


\V) Auto- propagation of Translations 
__ Remove Tags 


File locations 
Source Files Folder; 
C: \Usars\machame\AppData\|. ocal\DGT\OmeqatT_Projects\ TEST \source\, 
Translation Memory Folder: 
Cr\Users\machame\AppData\_ocal\DGT \Omegat _Projects\ TEST \tm\, 
| Glossary Polder: 
c: \Users\macharne\AppData\Local\DGT\OmegaT_Projects\TEST\glossary\ 
| Writeable Glossary File: 
C:\Users\macharme\AppData\Local\DGT \OmegaT_Projects\TEST \glossary\glossary.txt 
| Dectionary Folder: 
C; \Users\machame\AppOata\Local\DGT \Omegal_Projects\TEST \dichonary\ 
Transiated Files Folder: 
| C:\Users\machame\AppData\Local\DGT \Omegat _Projects\TEST \target\ 


Screenshot 39 — Edit Properties menu 


9— In the Project Files window that is displayed (empty), click on Copy Files to Source Folder, select the folder 
where you have the original document(s) and select the documents you want to use in your project. 


ue) Project Files (15) — ie = 


| rename beer Preotng taerher of Segments — taarniter of yeuncpum Gang, 


PT -TRA-OO. docx Macrosot Open 34 Opecon t 2a, Pres 
RTD 201 9-R0070'01:00-PT*TRA-OO.docx Mxrosall Open XMi Opencim | 2, 12, 
RYO 2013 66521 00-01 PT YRA 60. docx Microsoft Open XML Openxmy 181) 140 
2 D-701 p-00G27-00-00-"T-1Ka-00 Gece smcrosnh Open xy Upencny, 1. biw 
RTO 2913-00022 00°60) PT TRA 00.doce Miacrosc#t Open XML Open, i tis 652, 
MTD 2012-00922-00-00-FT THA 00Meex Microson Open XML “Openme | 1,542) 77 
(ETO- 201 3-80625-00-01-FT-THA-00 choc Mtr erent) Oren 30H Oper } 1 my 
RTD- 2917 -80024.00-00-PT-TRA-00.decx Micropatt Open xo Oper, | 4,099, aie 
Set-3019-90024-60-01.°T TRAO0 deck Micsosefl Oper Xi Open ii 1 oo 
WYD-2013:00025 00-00 FY TIA G0.decx “Microsolt Open KML OpenkMt | 1,403) 
IU I01 F-BUC27-O1-Gi-Ft-1KA-OU. Goce Mmcrosam Open xa Upenmy = sas | Mowe Fest 
(RTD 201.9 80026-00-00-PT°TRA 00 doce Mazosclt Oper Xi. JOperce ii $164) 320; —____— 
MTD 201 CO026-00-01-FT THA OU.decx Mecroson Open ML Open | 1 220 | Move. Up 
HTD-201 ¥-ROS??-00-00-FT-TRA-00 Gare Macrosel Opec XE Openome i tae t2e 
RTO 7012 00027-00-01 PT TRA 00.dece Microsoft Open Xi. LOpersne, t 4,434, ASS peeve Down 

| bere Sensi 

Total oumber of segments } i i P97 
Number of ureque ose 
Traratend ue secrets a 


Yeu 600 fied deteded stotistx informeben in the file: 
Cryweere\machame \App0sta\Localoc t \omegat _Projects\K! 0-204 J-FPP-Gunde 2014 \omega#tiproject_state tt 


Copy Mies to Source Heider... || Downioad Megane Mage. || Choe 


Screenshot 40 — Project Files menu 
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10 — OT displays the list of documents in your project. Click on Close to accept the opening of the first segment of the 
(first) document. You are now ready to start the translation of your project. 


If it is a multi-document project and you want to translate another document, just do Ctrl+L (or select Project 
Files in the Project menu) and the list will be displayed again and you can select the document you want by 
double-clicking on it. 


You can also change here the order in which OT displays the documents in your project by highlighting the name 
of a document and clicking on Move First, Move Up, Move Down or Move Last. 


11 — If you have external memories, in Windows Explorer copy them to the project \tm subfolder. 
12 — If you have machine translation files, create a \mt subfolder (not automatically created) and copy those files to it. 


D.3.2. Managing a project directly in OmegaT 


To update or modify a project directly in Omega’, it is, mutatis mutandis, as explained in Section D.1, but without the 
help of the DGT-OT Wizard to interface with Tradesk and so you will have to do all the operations manually. 


You can access the open project via the Tools menu, under Scripting — Project Folder or with the shortcut 
Ctrl+Shift+F1 and manage your project directly via Windows Explorer. 
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D.4. Project Statistics me 


In the Tools menu, you have, among other things: Statistics, Match Statistics and Match Statistics per File. 
These statistics are stored in the project \omegat subfolder and are replaced every time you run the statistics again. 
e Getting Statistics is a fast process, but Match Statistics (per File) can take a while if you have a really big 


project of hundreds of pages. The good thing is that, by default, you can do it any time (during a coffee or lunch 
break) and not necessarily when you create or update a project! 


Tools Options Help 


Validate Tags Ctrl+Shift+V 
Validate Tags for Current Document 

Statistics 

Match Statistics 


Match Statistics per File 


Screenshot 41 — Statistics in the Tools menu 


As OT will replace the statistics files when you run statistics again, if you want to keep a record of the progress in the 
translation of your project, you can access the DGT-specific OT_Stats Excel sheet via the DGT-OT Wizard by clicking on 


the Stats button. Mt 


DP i you wart to know how much you have translated dunng an hour/day'week, you can 

8) Before starting translating the project, paste - in Initial Statistics below - the values you obtained (and copied) with the Statistics feature in the OmegaT Tools menu 

b) AL any time during the transtation of your project, paste in Ongoing Statistics the values obtained with the same feature and afterwards copyipaste the value in cell M15 to cet B18 and downward: 
P You can also copy'paste to Match Statistics the values obtained wth the Match Statistics features in OmegaT Tools menu for future reference 


3 £ if yf 
tease bk 


Pages translated - Copy ne value trom cet M13 to cet B18 and downwards 

You wif have - in the Unique PAGES translated colurnn - the number of pages (1500 

characters without spaces) you translated since the beginning or between datesthours Match Statistics - Paste Omega Match Statistics below, starting at 
ince H17 yo aning pages to translate cea Ji7 


0 Good Matches 

0 interesting Matches 
0 Poorer Matches 

O No Matches 

0 Total 


Screenshot 42 — DGT-specific OT_Stats Excel sheet 
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D.4.1. Real time statistics 


Furthermore, the statistics of the segments translated and to be translated (unique and non-unique) are displayed in real 
time at the bottom-right of your OT window so that you always know how advanced you are in your work, in terms of 
segments. 


15/615 (14/551, 615) | [123/146 


15/615 Number of segments — translated vs. total for the current file 

14/551 Number of unique segments — translated vs. total in the project 
615 Total number of segments (including repeats) in the project 

123/146 Number of source and target characters in the current segment 


Screenshot 43 — Real-time statistics displayed at the bottom-right of the OT window 


™ Take into consideration that, if you have many segments you want to keep in the original language and you don’t 
translate them, they will be counted as untranslated in the statistics. So you may have a nice surprise at the end! 


D.4.2. Statistics 


In the Tools menu you can select Statistics to see the statistics concerning the documents in your project by segments, 
words and characters — unique and non-unique — with the indication of the segments already translated and to be 
translated. 


& | suggest also that you copy this initial statistics (highlighted in blue below) to the OT_Stats template 
automatically copied to your project by the DGT-OT Wizard (“® Section D.4.5. below). 


D Statics - , . a 


Screenshot 44 — Project statistics 
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D.4.3. Match Statistics 


Also in the Tools menu you can select Match Statistics in order to have statistics for your documents in terms of match 
percentages with the external translations memories in your project, and your project memory if you already started your 
translation, also by segments, words and characters (with and without spaces). 


& Again | suggest that you also copy these initial statistics (highlighted in green below) to the OT_Stats Excel sheet. 


| §2 Match Statistics 


Segments Words Characters (without spaces) Characters (including spaces) 


Hepetilions: 


Exact match: 


741843 


Screenshot 45 — Match statistics 


These statistics give you a (pretty good) idea of the translation work involved. Take into consideration that they are not 
“perfect” and that may be somewhat different from those obtained in other CAT tools. 


& | suggest that you run Match Statistics before starting the project, so that you can have the statistics exclusively 
from the external memories, i.e. without any of your work. 


In fact, when producing the match statistics, OT will take into consideration all the memories: the external memories in the 
\tm folder (and subfolders, if any) and the project_save memory (your translated segments). 


So if, in the middle of the translation process, you run Match Statistics again you will have a mixture of matches from the 
external memories and your project memory. 


According to the public OmegaT Help: 


y Repetitions stand for identical segments present several times in the text. The first segment and its 
contents will be classified as "no match", and the rest of them as a repetition of the first. 


y If the translation for several identical source segments already exists in the translation memory of the 
project, these segments, together with other, already translated unique segments, will be classified as an 
"Exact match", 


y The number of unique segments, if needed, is provided in the standard statistics window, regardless of 
whether they have been translated or not. 


y The rest of the categories (50-100%) involves untranslated segments with a fuzzy match. 


y Fuzzy matches can come from the /tm folder and from the internal translation memory in Jomegat folder, 
as is the case for repetitions and exact matches. The only difference with matches from the project_save 
translation memory is that external TMs cannot give exact matches, only 100%. 


y Spaces between segments are not taken into account in the last column. 
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D.4.4. Match Statistics per File @ 


This is a new OT feature that is very useful to detect the documents in a project which have (many) repetitions, if any, 
within the documents in the project. 


These statistics give you a detailed view, per document in the project, of how much new text you will have to translate as 
work progresses. It is also very useful to distribute work if there are several translators (and/or revisers) involved in a 


project. 


Match per File | & 
File 1: RTD-2013-86020-060-00-PT-TRA-00.docx ed 
Segments Words Characters (without spaces) Characters (including spaces} 

Repetitions within this file: 4 6 18 9 
Repetitions from other files: ] 0 0 0 
Exact match: t 22 129 iso 65 
95%-1008: 17 44 230 248 
858-948: 18 130 139 
75$-84%: 34 239 260 
$9$-74%: 43 $71 3399 3915 
|tio match: 3557 20072 23460 

}) Total: 237 4252 24217 28191 
|\File 2; RTD-2013-60020-01-00-PT-TRA-00.docx 

i Segments words Characters (without spaces) Characters {including spaces) 

|| Rapetitions within this file: a) 0 0 0 

|| Repetitions from other files: 215 3789 21464 24967 

| |(@xact match: i 22 129 150 
953-1008: 7] 0 0 0 
85$-34%: 0 e ° is) 
7$%-84%: 3 0 0 0 
508-749: 0 0 0 0 
No match: 14 420 2402 2808 
Total: 239 4201 23998 27928 

|\rtne 3: 'D~2013~80021-00-01~-PT-TRA-00. docx 

|| Seqments Words Characters (without spaces) Characters {including spaces) 
Repetitions within this file: 29 42 206 208 

\ Repetitions from other files: 4 + 8 3 

|| 2xact match: 1 4 30 33 
95%-100%: 31 104 617 687 

t 854-944: t $ 33 37 
||759%- 844%; ) Q Q q 
|'508-748: 23 395 2365 2751 
No match: 86 1941 11439 3306 
Total: 161 2495 14718 7030 

| 

File 4: RTD-2013~80022-00-00-Pt-TRA-00.docx 

Segments Words Characters (without spaces) Characters (including spaces) ~~ 


Screenshot 46 — Match Statistics per File 
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D.4.5. OT_Stats — Excel sheet template ae 


You have automatically copied to your project main folder an Excel sheet template — OT_Stats — in which you can 
record manually your progress in the translation of your project, just by copying/pasting the data in the statistics given by 
OT as shown above. The results are converted to pages (1 page = 1500 characters without spaces). 


You can easily open this Excel sheet from the DGT-OT Wizard or open it from your project folder (Ctrl+Shift+F1) in 
Omegat. 


Df you wert to know how much you have translated dunng an houridey’week you can 

3} Before starting translating the proect paste . in Indfial Statistics below . the values you obtained (and copied) wih the Statistics feature n the Omegal Tools menu 

b) AL any time dunng he Pansiaton of your proect paste in Ongoling Statistics the values cbtaned wth the same feature act aterwards copy/paste the vatue m cel M13 to cet B18 and Gownwards 
DP You can also copyipaste  Mateh Statistics the values ottamed with the Match Scatistics features in OrnegaT Tools menu for ‘uture reference 


Tctal | T 


Pages translated . Copy t= valus trom cel M13 to cet B1B and dowrmards You wilhae -n 

the Unique PAGES transtated coum - the quenber of pages (1500 characters without spaces) you aes 

translated since the beginning or between dstesihours In cet H17 you wil have the rumber of remaining Match Statistics - Paste Omecat Match Statisics bel, starting st 
cel S17 


oct 


swos] Chrs wi sp 


Fagen’ 5 ni) 
96%. 100% 4424] 34308) sei aia 1 _| 
185%-94% 6] 13584) 13598 87628) 49 | 18! Good Matches 
pees | oe 21] 2t Interesting Matches 
pes | a, _34__| 3t Poorer Matches 
46 | = 46 No Matches 
| 340 | 340 Total 


Screenshot 47 — OT_Stats excel sheet (example) 


In this Excel sheet, you can record statistics in terms of words, segments, characters (characters with and without 
spaces) and pages (1500 characters without spaces): 


1— Copy the Statistics results shown in Screenshot 44 marked with a light blue line to the corresponding cells in the 
Excel sheet above also marked with a blue line to record the initial statistics before starting translating a project. 


2— Copy the Match Statistics results shown in Screenshot 45 marked with a green line to the corresponding cells in 
the Excel sheet above also marked with a green line to have the initial Match Statistics before starting 
translating a project. 


3—_ As your work progresses, you can select again the option Statistics and copy the results to Latest Statistics in 
the Excel sheet marked with a dark blue line above to see how much you have progressed. 


4— Furthermore, you can copy the cell marked in violet to the Pages translated section if you want to have an idea 
of how much time it is taking you translate that project by hour/day/week etc. 


5— You can repeat points 3 and 4 replacing the Latest Statistics as many times as you want. 
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D.5. Editing Machine Translation memories a 


For a medium/large project, if you are using machine translation, you may have repeatedly inconsistent terminology that 
you may prefer to correct in the machine translation file so that it already proposes segments with these inconsistencies 
corrected. 


The most practical way is to use the Euramis Alignment Editor that you can access just by searching for it in the DGT 
Applications. 


1— Open the Euramis Alignment Editor 
2— Select File + Open 


3— The OmegaT projects folder is: C:\Users\{user login}\AppData\Locall\DGT\OmegaT_Projects. Select the 
project you want, its \mt subfolder and the MT file to be changed. 


4— Use the Search — Search/Replace feature to change the terms/strings you want. 
5— Save the changed file. 


Pu Euramis Alignment Edinor « C\Users\nachame\AppOata\L ocal GT\Omega}l_Prorects\ T/ ELARG- 20) 4-89031-B0034 eat ELARG 2014-80031 -00-00-EN-ORD 00 EN-PT MT timo o & & 


Pet ae, ee ce Oe it wee eee eee Ca! ek 


Translation sentence: ese 


Replace | Merge Split | GoTo | 


Opboss Commiss< 18/11, 


Screenshot 48 — Changing MT output in Euramis Alignment Editor 


— PART E— 
DGT-OT MENUS EXPLAINED 
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E.1. General 


Depending on the way you prefer to work, you can use the Menus to perform all operations or use (Some) shortcuts or 
icons. In Part Q you will find a list of all the shortcuts (both from the public version and DGT-specific). You can change 
them... if you are an advanced user. 


In this Section are briefly explained the menus available in DGT-OT so that you can have an overview of all the options 
available and quickly see the ones that are of interest to you. 


As some of the most important features have options in several menus and submenus, they are explained in detail in 
separate thematic sections of this Guide. The most important menus for general daily work are highlighted in green 
below 


& You can print the ones more important to you until you get familiarised with DGT-OT features. 
OmegaT has 7 main menus sometimes with submenus. Each menu/submenu allows carrying out several operations: 


Y Project: 
Vv Properties 
f§ Segmentation 
& File Filters 
Vv Project Files 
y Edit: 
Vv Switch Case To 
v_ Select Match 
y GoTo 
y View 
Vv Modification Info 
y Tools 
y Options: 
v_ Machine Translation 
v_ Glossary 
Vv Transtips 
Vv Auto-completion 
§ Glossary 
(§ Auto-text 
f§ Character table 
Font 
File filters 
Segmentation 
Spell Checking 
Editing Behaviour 
Tag Validation 
Team 
External TMXs 
View 
Saving and Output 
Proxy Login 
Restore Main Window 
Language Checker 
y Hel 
User’s Manual 
About 
Last Changes 
Log 


<<< <G << << <<< <<< <<< 
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E.2. Project menu a&e 


This menu allows, among other things, to create, open and update projects, change the order in which the original 
documents are displayed and change project settings. 


For some of these features — besides the shortcut — there is also a DGT icon. 


Projet Et GoTo View Tools Onions Hel 


Bt New. Clie 
Download Team Project, 


To open a project. In DGT, the access to OT is done via the DGT-OT Wizard. 
BD Oven., (ti+0 


It is recommended to always open projects via the DGT-OT Wizard to have 
automatic backups every 10 m. Confidential documents are the exception. 

Copy Files to Source Folder. 

[vsvsesnocr Lvl ect ae. 


Reload when new source files are added to the project or source files are 8 


deleted. OT also prompts you if you want to reload when you change your 
To close the project without closing OT. g (lose CtrleShitteW 


preferences 
Manual save. By default, OT does automatic backups every 3 minutes to et 
the \omegat subfolder. This has nothing to do with the backups the ave (tl +§ 


DGT-OT Wizard does of the whole project every 10 minutes to the H:drive. 
To generate all the (finished or incompletely) translated documents in the 

Cede Tate Dumas 
opps - original (source) document you are translating in View source file Ctrl +H 


MEO open the target document you are translating in its native \i { tfi Ctl ( 
application. Be sure that there are no translated documents open in their leW alge | é + 
native applications. If there are, OT will get stuck! 


B to generate only the (finished or incompletely) translated document Create Curent Translated Document Ctrl+Shit+D 


you are working on in its native application. The same applies: you cannot 
Properties. (trite 


have translated documents open in their native applications. 
Project File, CtreL 


Quit (tl+Q 


To create a new OT project. This is usually done via the DGT-OT Wizard to 
benefit from the integration with Tradesk. 


To add original documents to the project. It is recommended to do it via the 
DGT-OT Wizard to have automatic cleaning of useless tags (TagWipe) and 
an IATE extraction. 


Project submenu that allows changing the project settings 
D Section E.2.2 below. 


Project submenu listing all the files in the project. It allows changing the 
order of display of the source documents in the Editor. 


> Section E.2.1 below. 


To close the project and OT. 
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E.2.1. Project submenu: Project Files 


After creating a project with the DGT-OT Wizard, the list of documents in the project is displayed. Here you have 
information about the documents in the project: name of the documents, filters used, encoding and some statistical 


information. 


In this menu, you can also change the order in which the documents are displayed for translation in the Editor. 


If you have several documents in your project and you want to start translating a document which is not the first one 
displayed in the list, just click on the number/name of the document you want and OmegaT will display it in the Editor 


pane and open its first segment. 


If you change your mind, just press Ctri+L (or in the menu Project select Project Files) and you will see this list again 


and you can select another document or change the order again. 


If you had already translated a part of your project, when you open it OT will “remember” the last segment you edited 


and will open it (instead of the first segment of the project). 


Technical information: filters 


Number of files in your project : 
and encoding 


Number of total segments (repetitions included) by document 


Number of unique segments by document — 
non-unique (repeated) segments excluded 


= 


2S) Project Files “ 

een Fitter 
RTD-2013-80920-00-60-PT-TRA-GO,doce Microsolt Open XML OpenXMt | 
RTD-2013-80020-01-00-PT-TRA-00.doce Microsoft Opes ML (OpenXML 
RTD-201}-80021-00-01-PT-TRA-00.dece ‘Microsoft Open XaML ‘Openet 
KTD-2013-80022-00-00-FT-TRA-G0.docx (Microsoft Open XL OpenXMe 
RTD-2013-80022-00-01-FT-TRA-OO, docx Macros? Open OM. Opens, 
RTD 2013-800235'00-00-FT-TRAO0.doce Microsoft Open XML Openkee 
RTO-2013-80023-00-O1-PT-TRA-O0.docx (Microsoft Open XM ‘OpenxM. 
(RTO-2012:80024/00-00-F'T TRA G0. doce Microsoft Open XML pero 
RT0-2913-80024-00-G1-FT-TRA-00,doce Microsoft Open Xi Operxe 
RTD 2013°90025-00-00-"T -THRACO.docx jMicrosalt Open 201 Opera. 
RTO-2013-60025-01-O1-PT-TRA-G0,doce Microsoft Open XML OpenxML 
IT0-2013:00026-00-00-FT-TRA-CO.docx [Microsoft Open xsi Opencoe 
RTO-2013-80026-00-O1-PT-TRA-G0.doce Microsoft Open XML pene 
RYO-201 3 8062? 00-00-FT- TRA-O0. doce (Microsoft Open 2st ‘Openxh 
RTD-2013-80027-00-O1-PT-TRA-@0.docx Microsoft Open XML Opera 


Documents in your project. By default, documents are displayed 
by alphanumeric order. 


You can fied detaded statist: eformaton in the tite: 
Pilea armas irmacrcesceat as ote ccompailr er AR ene ca cine ron 


To copy originals to the project \source folder. 

In DGT, this is usually done via the DGT-OT 
Wizard which automates the process importing 
translation memories and an IATE extraction and 
cleaning superfluous tags (TagWipe) in docx 
documents. 


@ To define the order in which 
documents are displayed in the 
Editor, highlight the name of a 
document and move it up, down, 
first or last in the list 


Snapshot statistics by segments: 
total and unique segments and 
already translated segments 
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E.2.2. Project submenu: Properties — Edit Project 


In this menu you can define/change some project settings. In general, you will not have to define any as the DGT-OT 
Wizard creates the project with adequate tokenizers, segmentation and file filters. 


Concerning segmentation, it is by default done at segment-level, but you may want to change it to paragraph-level in a 
particular project. However, take into consideration that segmentation rules in Euramis and Machine Translation are also 
at segment-level and therefore you will probably not have many/any matches from them. 


If you want, you can also change the segmentation rules themselves. The changes made in this menu are 
project-specific. By default, segmentation rules are always project-specific when a project is created via the OT Wizard. 


You can also change here the location of your projects folders, namely the glossaries and external memories location. In 
DGT, the DGT-OT Wizard creates the new projects in the OmegaT_Projects folder and it is not recommended to 
change the project structure... unless you know what you are doing. 


Languages: Defined by the DGT-OT Wizard when creating a new 
project. 


® Technical setting: Tokenizer 


By default, segmentation is at segment-level (as in mci i é 


Euramis and MT). You can change it to paragraph-level by — aad 5 | ir | 
unticking the box here. It will only apply to this project. ea : erg 
Trarateted Fites Language: seen _ 


ee —— LucenePor tuguese Toker ner * Lucene current 


as Enable Seonencetevel Seomentng ® Section E.2.3. | ® section E23.below. Ly =a 
I Auto-cropagation of Translations ® Section E.2.4. below. 


[Remove Tags 
Externa! Post orocessing Conmand (Diestied): 


You can translate a project without any inline tags by | 


activating Remove Tags. It is not recommended to do so 
after starting the translation of a documentiproject. Me locators 


Source Fes Folder: fromse_| 
i wale nema seaiviatnaac 7 


ZL IMEGAT PROX TOSSUT-20 14 RID-208 3-2) 14-GRANTS GG 2- 16-1 PROD-TW Om igossary | 


Auto-propagation can be deactivated here. You can 
change this setting at any time in the translation of a 
project, but segments already propagated will remain so. 


Here you can change the location of the writable glossary QOL IPEGAT PROR TOS T-2014)R1D-2013-20 14-GRANTS GIG 2- 1S LUIS ROD- TW Om \hossery lossary. txt 

file. It must be the same as the location of the Glossary Cictonary Folder: Sromse | 

folder. sPorbcteps\cataiisomegstikctoweserew, 
Trendeted Ples Folder: Gromse | 


=: ZO TOMEGAT FRO IOS CUT-2L4RID-20 23-20 14GRANTS GIG 2- 156" LS -PROD- TW OM ltrpet|, 


Here you can change the location of the dictionaries, if 
any. 


_% | _ cn | 


About stemmers. Here is some basic information taken from the OmegaT Help: 

Tokenizers (or stemmers) improve the quality of matches by recognizing inflected words in source and translation 
memory data. They also improve glossary matching. 

A stemmer for English, for example, should identify the string "cats" (and possibly “catlike", "catty" etc.) as based on the 
root "cat", and "stemmer", "stemming", "stemmed" as based on "stem". A stemming algorithm reduces the words 
"fishing", "fished", "fish", and "fisher" to the root word, "fish". This is especially useful in case of languages that use 
pre- and postfix forms for the stem words. 
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E.2.3. Project submenu: Project Specific Segmentation Setup 


Again, in general, you will not have to change anything in this menu as the DGT-OT Wizard creates the project with the 
adequate segmentation rules for the language pair of your project. 


Segmentation is always a problem although the segmentation rules have been greatly improved to match, as much as 


possible, Euramis and Machine Translation rules. But they are not perfect ... namely with poorly formatted original 
documents. 


The IT Unit tries to improve segmentation rules for the 24 official languages and therefore there are sometimes changes. 


As projects can be updated with new versions or documents, in DGT segmentation rules are, by default, project-specific 
so that, when you update a project you will not have — as orphans/untranslated — segments that you had translated 
before ... but which are considered untranslated due to different segmentation rules. 


If you want you can change those rules in this menu and they will be project-specific. To change the segmentation rules 
for all your projects, “2 Options —- Segmentation menu. 


® Chapter 14 on Segmentation of the public OmegaT Help/Guide for more information as this feature is not covered in 
this Guide. 


™ Don’t make changes in this menu ... unless you are an advanced user and know what you are doing! 


¥]é Project Specific Segmentation Setup xi 
Sets of segmentation rules: 
Note: All of the segmentation rule sets with a matching Language Pattern are applied in the given 
order. 

Thus, for example, rules for Canadian French (FR-CA) should be higher than rules for French (FR.*), 
and higher than Default (.*) ones. Then while translating from Canadian French your project will use all 
the rules defined for all the language chain in the correct order. 


JV Make the segmentation rules project specific 


atalan = Move Down | 


; Segmentation rules are applied in the following order: 


Add 


Remove | 
Move Up | 


Move Down 


[yee 
a 
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E.2.4. Project submenu: Project Specific File Filters 


Again, in general, you will not have to change anything in this menu as the DGT-OT Wizard creates the project with the 
adequate file filters for the documents in your project and so this feature is not covered in this Guide. 


However, here is some basic information taken from the OmegaT Help: 


Omega? has highly customizable file filters, which enable you to configure numerous aspects. 


File filters are pieces of code capable of: 


a) 
b) 
c) 


Reading the document in some specific file format. For instance, plain text files; 
Extracting the translatable content out of the file and 


Automating modifications of the translated document file names by replacing translatable contents with its 
translation. 


In this menu, you can change the file filters for a particular project. 


But you can also change the file filters for all your projects. “& menu Options — File Filters 


@® For information on this subject, see Chapter 7 in the OT Help/Guide. 


™ Don’t make changes in this menu ... unless you are an advanced user and you know what you are doing! 


aye Project Specific File Filters nn x] 


View or edit file filters. To edit what files in what encodings the filter will process, select the filter from the list 
and dick Edit. If the filter has any options, you may change them by dicking Options. 


T_ Make the file filter settings project specific 

I¥ Remove leading and trailing teas 

[I¥ Remove leading and trailing whitespace in non-seamented projects 
[— Preserve spaces for all tags 


J[™ Ianore file context when identifying seaments with alternate tanslations 


Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
iv 
Iv 
ial 
Iv 
Iv 
Iv 
ial 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
Iv 
ire 


Restore Defeults | [ox | Cancel | 


a 
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E.3. Edit menu se 


This is a very important menu for your daily work! In this menu you have a list — with shortcuts — of OT features for 
editing. 


® Sections on Translation Memories and Machine Translation, Tags, Glossary, Search, Search/Replace, Pre-translate 
and Auto-propagation for more detailed information. 


Besides these shortcuts, the Windows keyboard functions/shortcuts select all / copy/paste, cut-paste and drag and drop 
work within and between OT windows (Fuzzy Matches, Search and Notes windows) and also from and into external 
applications (Google, Euramis, Eur-Lex, etc.). For some operations, you can also use the DGT icons. 


Edit| Go To View Tools Options Help 
Se — 
*& Undo Last Action Ctrl+Z 
In the segment open in the Editor, to replace the text by the segment in 
bold in the Fuzzy Match pane (by default the 1st match) 


“Redo Last Acton ce 


— the segment in bold in the Fuzzy Match pane 
4 Insert Match Ctri+] 


Seo ee EMee Replace with Machine Translation Ctrl+M 


het Sou Chis 
Insert Missing Source Tags Ctrl+Shift+T 


@ To insert the next missing tag in the target segment 


_ Insert Next Missing Tag Ctri+T 


To export the current selection to a text file for processing 


Sl sean 
a 
Create Glossary Entry Ctrl+Shift+G 


‘\ Search Project... Ctri+F 


. Submenu to select the 
Switch Case To Fuzzy Match segment to 


Lower Case 


vac eeee. replacelbe inserted in the 
Cycle Shift+F3 Select Match target segment in the 


Translation that will be auto-propagated to all non-unique segments. Use as Default Translation 
Translation that will not be auto-propagated to the other non-unique Create Alternative Translation 
segments. 
ayer Remove translation 
Deletes all the text in the target segment open in the Editor. Equivalent to 
1+X. 5 
sii Set empty translation 


Deletes all the text in the target segment open in the Editor AND that . 
Registe 


Editor 


Select Previous Match Ctri+Up 
Select Next Match Ctri+Down 
Select Match #1 Ctrlel 
Select Match #2 Ctrl+2 
Select Match #3 Ctrl+3 
Select Match #4 Cute4 
Select Match #5 Ctrl+S 


segmentiline will be empty in the translated document in its native 
application. 


dentical Translation  Ctrl+Shift+S 


@ If Allow translation to be equal to source is not active in the 
Editing Behaviour menu, it allows — for the specific segment 


open in the Editor — to have identical source and target 
segments. 
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E.3.1. Edit menu — Search Project window B® 


This is a very important window that is activated by pressing Ctri+F or selecting that option in the Edit menu. 
The Search (concordance) feature in the public OT is very complete and allows to search by a series of criteria. 
In DGT, this feature has been further improved with new features very useful for translators. 


® Part G for detailed information on the many Search options. 


A part of this menu (in green below) can be hidden so that you have more space for the results ... if you don’t need 
those options. This applies also to the Search Directory, Search/Replace and Search and Translate windows. 


You can also customize the Attributes of the searched segments displayed in the Search window (your preferences), 
for example to have displayed - in the Fuzzy Matches pane - together with the source and target segments, the notes 
you inserted in the segments you translate. 


You can change them any way you want by clicking on Configure format and choosing the variables — in the Match 
Display Template — deleting, inserting and ordering them the way you prefer. You can easily change it any time you 
want. 


®) Part O for detailed information on how to customize the Attributes. 


_ Same tex in ab fields AND @ OR 


= ey 
w Remove duplicates Translated — Uvrensisted © Translated or urtransiated 

Search scope 

Source files Marmory VMs | Gowseries File or fichier name: * | Mengrae 
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E.3.2. Edit menu — Search/Replace window B&@ 


This is a very important window that is activated by pressing Ctrl+K or selecting that option in the Edit menu. 


The Search/Replace feature in OT is very complete and allows to search/replace by a number of settings. It has similar 
(but not as many) options as the Search feature. 


In DGT, this feature has been substantially improved. 


A part of this menu (in green below) can be hidden so that you have more space for the results ... if you don’t need 
those options. 


® Section G.3 for detailed information on the Search/Replace options. 


® Part O for information on how to configure the Attributes display. 
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E.3.3. Edit menu — Search Directory window & 


This is a very interesting feature that is activated by pressing Ctri+Shift+K or selecting that option in the Edit menu. 
It allows to search terminology/phraseology in monolingual reference documents in the project, if any. 


The Search Directory feature in OT is very complete and allows searching monolingual documents by a number of 
settings. It has similar (but not as many) options as the Search feature. In DGT-Omegaf¥, this feature has been 


improved. 
A part of this menu (in green below) can be hidden so that you have more space for the results ... if you don’t need 
those options. 


® Section G.4. for detailed information on the Search Directory options. 


® Part O for information on how to configure the Attributes display. 


&) Test Search 


=» —- — 
Test ts search 
earch for 7 Mamenze 
Expeemon mode Word mode — 
Gext seerth «= Heyerced search Regaer expreswons I ings Whole words) Lertena: 
7 Search options r 
¥ Remove duplicates 
Sanaa » 
Loreen — 
Locate Select Folder oy Returvve march 
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E.3.4. Edit menu — Search and Pre-Translate window B&@ 


This is a completely new feature in DGT-OmegaT which allows searching by certain criteria and afterwards 
pre-translating the selected segments. It is activated by pressing Ctrl+Shift+Z or selecting that option in the Edit menu. 


It can be used to search, for instance segments only with numbers and to copy, in a batch operation, the source to the 
target segments. It also allows to pre-translate (from external memories or from machine translation) the selected 


segments. 


A part of this menu (in green below) can be hidden so that you have more space for the results ... if you don’t need 
those options. 


® Section F.8. for detailed information on the Search and Pre-translate options. 


@® Part O for information on how to configure the Attributes display. 


Search and Pre-transiate a %& 
Search fo: +) Memorize 
Treralete es: @ Source — Match Mine! score: & Moxhing trorslotion Prefix: 
Tec gular 2x; 
Depresace mde - Word mode — — 
Sect search 9 Keyword search Regier eqressins Case senstne | Sings Whole words @ Lemmas 


Transated @ Votransiated — Translated or urtransiated 
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E.4. Go to menu se 


This is a very important menu for your daily work! To move around in the OT Editor is easy and fast even with large 
projects. 


To open a segment anywhere in your document just double click anywhere on it with the mouse left button. To go to the 
next segment, press Enter or click on icon 10, which will save the translation of the currently open segment to your 
project memory and open the next segment. 


If you want to skip segments leaving them untranslated and continuing with a segment far below, just scroll to the 
segment you want and double click on it. OT will save the segment you were in and open that other segment. 


If your project has more than one document and you are in the last segment of your first document, when you press 
Enter it will open the first segment of the second document. 


To move around the Editor you can use the shortcuts, the icons (“@® Section A.7) or click on the Go To menu and 


chose an option. 
View Tools Options Help 


Nevt Untranslated Segment Ctrl+{) 

bs] It validates the segment you are in and opens the next translated 
segment. It is useful if you want to check your translated segments while : 
skipping segments that you want to remain untranslated. Next Translated Segment Cirle Spite 
The same as Enter. It validates the segment you are in and opens the next | 
segment for editing. = Next Seqm ent (tr +N 

= Previous Seament (trl+P 
know its number), just write the number of the segment in the field displayed = Segment Number... Ctr +| 

and it will save the segment you are in and open the selected segment. 


If you have segments with notes (by default highlighted in pink), you jump to = Next Note Cirl+Snitt+A 


them successively opening them for editing. 
To delete a note, just delete all the text (including spaces) in the Note pane 


BM ir escacisnt tronmuail 7” Previous Note Ctrl+Shiti+8 

Nevt Revised segment —  Ctri+Snitt+X 
p Previous Revised segment Ctrl+Shitt+Y 
Forward in History Ctrl+Shit+N 
Back in History Ctrl+Shitt+? 


It validates the segment you are in and opens the next untranslated segment. 
It is especially useful if you have a new version of a document. 


\y 


I 


If you want to go to a particular segment in any of your documents (and you 


om You can go forward or backward opening segments which were 
translated or modified by another user (different login). You also have 
shortcuts. 

This is used in the revision process when the translator finalizes the 
translation by accepting or rejecting the changes made by the reviser. Those 
segments are highlighted with a red background if the option Mark Revised 
Segments — in the View menu — is activated. Track-changes are 
automatically displayed in the target segments in the Fuzzy Matches pane. 


You can go forward or back opening segments by the order you edited them 
in that session with that project. 
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E.5. View menu se 


This is a very important menu for your daily work! 


Source and target texts are displayed vertically one on top of the other. This applies to the Editor and Fuzzy Matches 


panes and to the Search windows. 


OT preferences are 1-level and OT will “remember” the last preferences you set when you close and reopen it, either 
with the same project or with another project. Of course, you can easily change the preferences at any time. 


You can start translating with the default settin 
preferences. 


gs, but you may also want to change some of them to suit your 


There are several options concerning the display of the segments in your text. You can choose at different stages of your 
translation to have them displayed differently just by clicking or unclicking that option in the View menu. When the option 
is active, the coloured box is marked with a line around it. 


If translated, target segments are always displayed. 
Furthermore, translated segments will be highlighted 
with a green background if this option is selected. 


Untranslated source segments are always displayed. 
They are highlighted with a yellow background if this 
option is selected. 


Source segments of translated segments are displayed 
together with the target segments if this option is ticked. 
They are displayed with a blue background (default). 


Segments repeated in the whole project (100% identical 


including tags) are displayed greyed (non-unique 
segments (default) 


Segments with notes are highlighted with a_ pink 
background when closed if this option is ticked (default). 


Smet Segments created/modified by another user 


(different login) are displayed with a red background if 
this option is ticked. Used in the revision process. 


Non-breakable spaces are displayed greyed (default). 
White spaces are displayed with a dot, if this option is 
ticked. 


@ Useful for Right to Left languages, which is not the 
case of the EU official languages. 

@ To mark segments pre-translated from an external 
memory present in the \tmlauto subfolder (default) 


Mark Translated Segments 

Mark Untranslated Segments 

Display Source Segments 
Mark Non-Unique Segments 

Mark Segments with Notes 
Mark Revised Segments 

Mark Non-breakable Spaces 
Mark Whitespace 
Hl Mark Bidirectional Algorithm Control Characters 
Mark Auto-Populated Segments 
Modification Info 


By default, the segment open in the Editor displays its 
ID — login, date and hour — if already translated in the 
project. You can also see no identification at all, or have 
all segments identified in the Editor by selecting it in 


Display None 
Display for Current Segment 
Display for All Segments 


this submenu. 
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E.6. Tools menu se 


This is a very important menu for your daily work! 


The Tools menu — and related submenus and options — are explained in detail in the thematic Sections on Tags, 
Statistics, Terminology, Spellchecking, Quality Check and project management. 


To check and correct tags in whole the project. T . 
[rcinicnsiornrneoos _] Fag Options Hel 
alate Tag Csi 


Validate Tags for Current Document Ctrl+Shift+) 


To generate Statistics of the whole project and by document (by 
words/ segments/characters, translated/untranslated) 


Statistics 
To generate Match Statistics for the whole project (which include 
Ee ee Statistics 
@ To generate Match Statistics for each document in the whole ict i 
| 


Scripting... 

1 - Open Project Folder Ctrl+Shift+F1 
2 - Open Glossary Ctrl+Shift+F2 
siebelebillelies 3 - QA - Check Rules Ctrl+Shift+F3 


| 
identical 4 - Show Same Segments Ctrl+Shift+F4 


a ane 
you are working on. 


fea | ae 
—— 9 - Create OmegaT Export Ctrl+Shift+F9 


@ BB to generate translation memories ocument to be sent * ° ° 
for an . to be ste _ . 10 ° Write Query Notes to File Ctrl+Shift+F10 
1] - <none> Ctrl+Shift+Fld 


bs] A quick way to open in Windows Explorer — via OT — the 
folder of the project you are translating 


@ To open the writable glossary in Notepad++ for editing 


@ To do a Quality Check for the whole project or the document 


2 To extract a list of selected Notes from the project in html 


= 12 - <none> Ctrl+Shift+F12 


To generate a | Togeneratealistof untransiatedsegments = LL of untranslated segments 
[tecrwsestetmmetsenns a ualty check 
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E.6.1. Tools submenu — Scripting @ 


Scripting is a new feature in OmegaT which allows to easily integrate features developed by the open-source community 
around OT. 


Scripts Folder: C:\Pgm\DGTapps\CAT2014-dev\Omega?T \scripts 


Check Same Segment 
Currency Translator urpose: Expert sour and translation segments of user selected 
Example - GUI Scripting files into TMX-file 
Example - Key Binding * #Files: For each source file, writes a file with same name with tmx extension added 
Example ~ Modify Segment Sas re a 
Example - Search and Replace #Devails: http: //wp.me/p3fHEs-6¢ 
mOpen Current File 
Kos Ivantsov 


2013-08-12 


QA ~ Identical Segments 
SVN - Cleanup javax.swin 
SVN - Commit Source File : org.cmegat 
org.cmegat 

Amport org.omegat.util. 

amport static javax.swing.JOptionPane.* 

import static org.cmegat.util.Piatform.* 
create_euramis_export-antigo.gro 
reate_oure xport def prop = project.projectProperties 


eate eu Do: v 
create_omegat_export.groovy |Jae (!prop) ¢ 
toolbar.groovy C a 
iwrite_new_trans2TMX.groovy 
write_new_trans2TMX_GULgroovy 
write_notes.groovy 
write_queries.groovy 
write_selection2list.groovy 


< 


Script “write_queries.groovy* bound to siot #10. 
Script “write_queries.groovy” unbound from slot #10. 
Script “write_queries.qroovy” bound to siot #10. 


mt 


[ <6>.] <7>. | {_<s>_] [<9>] [, <10>.} | 


Some are already included in the public OT standard version, others which are available as open-source software have 
been integrated in DGT implementation of OT. Just use the relevant shortcut or choose the desired option in the Tools 
menu. 


® in the previous Section E.6. the scripts already available via the Tools menu. 


In this menu, it is possible to define up to 12 scripts (features). In DGT-OT, 10 are, by default, available as shown in the 
previous section and further explained in the relevant thematic sections of this Guide. 


If you want, you can easily choose from the list 2 other scripts not already defined, by: 
1 — Highlighting the name of the feature in the left column, 

2 — Right clicking on one of the buttons below: "<11>" and "<12>", 

3 — Selecting Add Script. 


The purpose of each script is, in general, briefly explained by its author. Each script is associated to a shortcut as shown 
in the previous Section. 


You can also customize the scripts you want to use by adding and deleting them to the scripting list. 


The options to Add and Remove a script will not delete it from your OT installation. It will just display it or not in the 
Tools menu. 


Some may not be straightforward. Ask for help if you need support! 


© If you have the know-how, you can even add your own scripts! 


® Appendix F — Scripting plugin in the OT Help/Guide for information on scripting and scripting languages. 


1 
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E.7. Options menu 


In this menu, you can define a series of parameters. Some of the options — and related submenus — are further 
explained in the relevant Sections on Translation Memories and Machine Translation, Terminology, Auto-Completion, 
Spellchecking and Quality Assurance. 


Options} Help 


~ Use TAB to Advance £4 The Glossary option in this menu provides 


access to Terminology as a Service. TaaS is the 
result of a research project financed under the 7th 


| Always Confirm Quit Framework Programme on RTD (http:/www.taas- 
project.eu/). For the moment it is not available in 
[ Machine Translate 


DGT. 
Glossary 


By default, to validate a segment 
and open the next segment, you 
press Enter. If you want you can 
change it and use the TAB to do this 
operation by activating this option. 


By default, when you press Quit, OT 
closes down without asking for 
confirmation. If you want, by 
checking this option, you will be 
asked if you really want to quit and 


you have the option to Cancel. ; ; 
Terminology entries from 


u =) the project glossary(ies) 
v Enable TransTips are displayed in the Editor 


fuzzy or exact match). 
Ls] In DGT, the only option is the M ‘ St -| ee feature can tf 
in-house MT@EC system. In the TransTips deactivated here. 


public OmegaT, there are several 
online MT systems available. 


Auto-completion 


Glossary... «-] To configure 
Auto-text... Auto-completion 


Character Table... 


Y/ 


®D section £.7.1 below. Font... 
To define the font and size of OT main window text 


File Filters... 
(Editor, Fuzzy Matches, Glossary) and Search and Tag 


Segmentation... validation wide 
Spell Checking... a 
Editing Behaviour.. * 

Tag Validation... 


Dp Section E.7.2 below. 


| 


I 


D Section E.7.3 below 


Sample Text 
This is the way text will look 
when displayed. 


D> Section E.7.4 below 


\ 


@®D section £.7.5 below. 1Y| Apply this fort to the Project Files window 


| 


= leam.. 
Your login (automatically defined) 
External TMXs... 
To define source and non-unique segments display. 
®D section £.7.6 below. View... © View Options hy | 


V) Dieplay ail source segments in bold 
IV) Indude the first noe-unique segment when mariang nor-unique segments 


To define how frequent oT saves Sse Saving and Output... 
Proxy Login... 
Restore Main Window ~ 

¥_ Language Checker 


your project to the project \omegat 
subfolder in your computer. 


ee 00 


(a) (ca 


Not used in DGT. 


If you have made changes to the window display and 
want to get back to the default display, just click on this 
option. 


To check potential language and 
grammar issues, highlighting 
those words with a_ blue 
underline. You can deactivate it 
here 
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E.7.1. Options submenu — File Filters 


In DGT, we have the IT Unit which takes care of these technical aspects. Therefore this feature is not covered in this Guide. 


However, an important option is the Ignore file context when identifying segments with alternative translations. 
When ticked (check it), alternative translations will be auto-propagated in all the files in the project as OT will ignore the 
document identification. If not ticked, alternative translations will only be auto-propagated in segments within the 
document associated with that alternative translation. 


™ These settings apply to all the OT projects. So don’t change them unless you know what you are doing. 


®) Section 7 in the public Guide/Help for more information. 


a ie 

View or edit file filters. To edit what files in what encodings the filter will process, select the 
filter from the list and click Edit. If the filter has any options, you may change them by clicking 
Options. 

¥| Remove leading and trailing tags 


¥ | Remove leading and trailing whitespace in non-segmented projects 


Preserve 


Ignore file context when identifying segments with alternate translations 


File Format Edit... 


XLIFF 

Text 

Android Resources 
Windows Resources 
Mozilla Lang 

Infix 

Typo3 110nmgr 

Help & Manual 
QuarkxPress CopyFlow Gold 
Magento CE Locale CSV 
PDF Input 

Key=Value Text 

XHTML 

SVG Image and Adobe Illustrator Ex... 
DocBook 

Camtasia for Windows 
HTML and XHTML 
SubRip Subtitles 
Mozilla DTD 

Typo3 LocManager 
HTML Help Compiler 

PO 

Wordfast TXML 

ResX 

Flash XML Export 
Microsoft Open XML 
DokuWiki 

Visio 

WiX Localization 

LaTex 

Java(TM) Resource Bundles 
OpenDocument 
CWCMS (Documentum) 
LgTranslation 

IMI 


Options... 


ISISISISISISISISISISISISISISISISISISISISISISISISISISISISISISISISI SSIS 
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E.7.2. Options submenu — Segmentation 


In DGT, we have the IT Unit which takes care of these technical aspects. Therefore this feature is not covered in this Guide. 
™ These settings apply to all the OT projects. So don’t change them unless you know what you are doing. 


® Chapter 14 of the public Help/Guide for more information. 


4 Segmentation Setup |X 
Sets of segmentation rules: 
Note: All of the segmentation rule sets with a matching Language Pattern are applied in 
the given order. 
Thus, for example, rules for Canadian French (FR-CA) should be higher than rules for 
French (FR.*), and higher than Default (.*) ones. Then while translating from Canadian 
French your project will use all the rules defined for all the language chain in the correct 
order. 
Language Name Language Pattern 
EN |[eE][nN].* | Remove 
FR [fF][rR].* Ic) 
BG [bB][gG].* | Move Up 
DE [dD ][eE].* | 
cs [cC][sS].* Move Down 
DA [[dD][aAl.* ia 
Segmentation rules are applied in the following order: 
Add 
Remove 
Move Up 
Move Down 


Restore beaut 
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E.7.3. Options submenu — Spellchecker Setup 


In DGT, we have the IT Unit which takes care of these technical aspects. Therefore, this feature is not covered in this Guide. 


® Chapter 22 of the public Help/Guide for more information. 


|V| Automatically check the spelling of text 


Dictionary file folder: 


|C:\Pgm\DGTapps\CAT2014\OmegaT\SPELLERS\Both 


Dictionaries already installed: 


bg_BG - Bulgarian (Bulgaria) 
cs_CZ - Czech (Czech Republic) 
da_DK - Danish (Denmark) 
de_DE_frami - German (Germany) 
el_GR - Greek (Greece) 

| |en_GB - English (United Kingdom) 
| URL of online dictionaries: 


http://download.services. openoffice.org/files/contrib/dictionaries/ 


Install new dictionary... 
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E.7.4. Options submenu — Editing Behaviour Options Be 


In this menu you have several important options: you can select the fuzzy match minimum rate, the automatic insertion 
(or not) of machine translation, to copy source to target and some other features. 


® Sections on Translation Memories and Machine Translation, Tags and Pre-Translation for more information 


With this option is unticked (default), the source text cle Editing Behaviour Options Xl 


is never inserted in the target segment open for 
editing. 


If this option is ticked, in the Editor, the source text Plaase select what text you would like t0 he inserted into the segment that ig not translated 


will be automatically inserted in the open target 
segment IF: a) Insert Machine Translation is ‘ 
deactivated and b) Insert best fuzzy match is yet, when you move to it 
activated but there is no match with the defined 


minimal similarity. () The source text () Leave the segment empty 


RT MT is automatically inserted if there is no fuzzy 
match/match with the minimal similarity threshold 
(Default). 


Insert Machine Translation If there is no match from translation memories — and 


eventually no MT if so defined — the segment is left 
empty. If you validate the segment without translating 
it, the original text will be taken in the translated 


By default, the best match from your translation Insert the best fuzzy match document in its native application IF Allow translation 
memories will be automatically inserted if within the to be equal to source is ticked (default). 


defined threshold. You can deactivate automatic Zz I 
insertion here. aa . ah r 
Nina iy ee 
Fuzzy matches threshold for 


By default, when there is a match in which numbers automatic insertion in the target 


are different (for instance dates, percentages) OT will segment in the Editor. 
not try to change them. You can activate it here, but Prefix: ' 


take into consideration that OT sometimes fails. 


‘ z You can add a prefix that will be 

Attempt to convert numbers when inserting a fuzzy match J CiSsE NESS Seni ey 
each automatically inserted 
fuzzy match in the Editor (Ex: 

fuzzy). If used, those entries 


F Allow translation to be equal to source can be filtered. In DGT-OT, by 


default there is no prefix. 


In DGT, by default OT accepts equal text in source 
and target (which are saved in the project memory). 
If you untick this option, the option Insert the source 
text will not work. 


[al 


OT exports data from within the current OmegaT 
project to plain text files for further processing. 


@rhis feature is important when you have a project Export the seqment to text files 
in which there are (many) non-unique segments ad 

which have a different translation. If you tick this 
option, OT will stop (and open) segments with 
multiple translations for you to check. 


| Go To Next Untranslated Segment stops when there is at least one alternative translation 


2 By default, tags are protected and can only be 
added or deleted. Here you can allow tags to be 
editable if you want to change them manually. 


\__ Allow tag editing 
You can have OT warning you — when you press 


Enter — that there are tags missing/misplaced in the ia Validate {ags when leaving a segment 


edited segment. You can therefore validate tags 
segment by segment if you tick this option. 


‘Save auto-populated status 


2 Very important when you use Pre-Translation or 
for the revision process. Segments pre-translated 
from the \tmlauto subfolder will be highlighted in 
orange in the Editor (default) — and will remain so 
unless they are changed — if the option Mark Auto- 
Populated Segments is activated (the default) in the 
View menu. 


tal 
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E.7.5. Options submenu — Tag Validation Options 


In DGT, we have the IT Unit which takes care of these technical aspects. Therefore this feature is not covered in this Guide. 


The option Allow translated tags to be in a different order is, by default, deactivated. However, for some languages or 


documents, it may be necessary to check this option to allow tags to be in a different order. In that case, check that there are 
no problems. 


@® See Chapter 12 — Working with formatted text in the public Help/Guide for information. 


3E Tag Validation Options 26 


OmegatT can also check for programming variables (printf-function 
variables) like '%os'. Please select which behaviour is appropriate. Full 
checking can lead to false positives in normal texts. 


(@) Do not check printf-variables 
~) Check simple printf-variables (e.g., %s, %d) 
~) Check all printf-variables (e.g., %s, %-s) 


| Check simple java MessageFormat patterns (e.g. {0}) 


|__| Allow translated tags to be in a different order 
Warning: Changing tag order may break the translated document! 


|__| Do not allow creating translated documents with tag issues 
Regular expression for custom tags: 


Regular expression for fragments that should be removed from translation: 
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E.7.6. Options submenu — External TMXs Options ae 


OT displays the segments in the translation memories — called external TMX — of your project which have matches 
with the currently open segment in the Editor pane (MT output is displayed in a different window and not mixed with 
human translation). 


In DGT, we have the IT Unit which takes care of these technical aspects. However, you may want to change some of 
these defaults, namely the number of segments displayed in the Fuzzy Matches pane and if you want — in the 
revision/finalizing phase — to see the track-changes in the target segments too. 


You may also want to choose the order of display of Fuzzy Matches: giving more importance to content than to 
formatting, for instance. 


You may also want to configure the Attributes display — as for the Search feature. 


~® Section 0.1. for information on how to configure the Attributes display. 


fan] A 
£4 Your can choose the order in which fuzzy matches are 


displayed in the Fuzzy Matches pane. By default, it is 


displayed by match rate including tags and numbers. But 
you. can select one of the other 2 options. 


PUI TEXT, INCIUGING tags and numbers 


Stemming, no tags and no numbers 
No tags and no numbers 


Full text, including tags and numbers 


2E Extemal IMX Options 


Sor fury matches by: Fl tet ncuding tags and umbers 
Pace select how tags of nor-CmegaT TMs should be displayed 
T)tispley tags) Use YO. for standalone tags (eg, <2 
Wi view af in source) View aff in target 
Watch dspayteate 


|$italeShorteath} ${instialCreationDate} 

Match: <${score}/$incStemScare}/${ediustedicore}4> - Source: <HAttssBeq. Secv.}-G(AttssYear}-@{Txt::Doc, No.}> - Translator: <#{Txz::Transletoz}> 

|(St19}) -> ORT DIFF: Sidifth er | ae ; : 

TM TRA: $[targetText} a hd Match display template which 
defines the way segments are identified in the 
Fuzzy Matches pane. It exists in the public OT 
but has additional variables — some new — 
in DGT-OT. 


mee In the revision phase, you may want — as the reviser — to see the 
track changes in the Fuzzy Matches pane also in the target segments. 
Just click on this option. 


By default, OT displays the first 10 best matches (if any), 
but you can increase that number up to 50. 


You can select here the attributes that will be displayed in ae : axes 

the Fuzzy Matches pane. OT will displays segments with a similarity threshold lower 
>| : oe ; than the one set in the Editing Behaviour menu up to the 
Section 0.1 for detailed information. number of matches selected here. 


| 
kd This may be useful sometimes as it may allow you to 
see matches at subsegment level. 
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E.7.7. Saving and output options menu 


Here you can define the frequency in which OT will make backups of you project memory to the \omegat subfolder. 


The other features are not used in DGT, as far as | know. 


§E Saving and Output Options 


Please select the interval at which the project should be saved automatically, in 
minutes and seconds. 


Minutes: 
Seconds: 


»| External Post-processing Command 
This command will be executed after Creating Translated Documents. 


Template variables: ${projectName} 


Also allow per-project external commands 
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E.8. Help menu 


The Help Menu gives you access to the public OmegaT Guide by chapters. 


™ Take into consideration that DGT version of OmegaT has some changes/adaptation/improvements. 
You have the same information — in pdf format — in the public Guide available in the DGT-OT Wizard. 
The Last Changes section of the public version is not available in DGT-Omegat. 


User's Manual... F1 


About... 

Last Changes... 
| Log. 
y Omegat 3.0 - User's Guide ae 
UW Next 


OmegaT 3.0 - User's Guide 


m 


Vito Smolej 


Abstract 


This document is the official user's guide to Omega’, the free Computer Aided Translation tool. It also contains installation instructions. 


Table of Contents 


1. About Omega - introduction 
41. OmegaT highlights 
2. Summary of chapters 
2. Learn to use Omega in 5 minutes! 
1. Set up a new project 
2. Translate the file 
3. Validate your tags 
4. Generate the translated file 
5. Few more things to remember 
3. Installing and running OmegaT 
1. Windows Users 
2. Linux (Intel) Users 
3. Mac OS X Users 
4. Other Systems 
5. Using Java Web Start 
6. Starting OmegaT from the command line 
7. Building OmegaT From Source 
4. The user interface 
1. Main OmegaT window, other windows and dialogs 
2. OmegaT main window 
3. Other windows 
5. Menu and Keyboard shortcuts 
1. Main Menu 
2. Keyboard shortcuts 
6. Project properties 
1. Properties dialog 
7. File Filters 
1. File filters dialog 
2. Filter options 
3. Edit filter dialog 


9 Amanat Cilae and Caldare 
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— PART F— 
TRANSLATION MEMORIES AND 
MACHINE TRANSLATION 
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F.1. Translation Memories and Machine Translation — a 
close relationship 


The close relationship between Translation Memories and Machine Translation (MT) is an old one. In fact you may not 
be aware that translation memories were — in the 90 — a by-product of machine translation research developed in the 
IBM Lab where the concept of Statistical Machine Translation was, so to speak, born. 


The prevailing approach at that time was Rule-Based Machine Translation, a lengthy and complex process which is 
language-dependent. It was the basis of DGT’s ECMT service that was available for 28 language pairs (10 of which 
prototypes) in the European Commission until 2010. 


In order to be able to apply statistical techniques to Machine Translation, the researchers developed what became a 
success on its own: the Translation Memories ... while Machine Translation went through a limbo phase. 


With publicly available MT services like Google and the release of the open-source Statistical Machine Translation (SMT) 
system — Moses — in 2007 within the Euromatrix research project with co-financing from the European Commission, 
SMT became more widespread — and viable — as it was language-independent and allowed to quickly train language 
pairs with only one requirement: large bilingual corpora ... and that the European Commission — and DGT in particular 
— have plenty! 


The new SMT service of the Commission — MT@EC — managed by DGT started to be tested in 2011 and is now 
available to the EU institutions and also to the Member-States administrations, including universities. 


Machine translation output used in DGT-OmegaT is — of course — the output from MT@EC. 


An interesting feature in OT is that human translation matches from Euramis and machine translation output are never 
mixed up. They are displayed in separate panes. This is particularly useful for language pairs in which MT quality is poor 
— or highly uneven — as it allows having the best of both worlds: not to have MT automatically inserted in the segment 
open for editing ... but to be able to see it in a separate pane and eventually use it just by pressing Ctri+M. 


a | ea 
© Omegal-3.12 + DGT Extensions 2.3-beta update $ = T-RTD-2013-55-56-Gd-2015 =O & 


—— + . Wi i E 

B¥Cm SE AKAVY PSI OSE HTO 

Machine Transtapos Furzy Matches o 
1-MAIN-REFERENCES\NoDG-2009-32009R0723_EN-PT-DWN.tmx |) * 
26/10/09 10:16 

Match: <77/60/70%> - Source: <-2009-32009R0723> - Translator: 


A lei do Estado-Membro de acolhimento no caso de questdes ndo regulamentadas, ou ndo 
percialmente regidas, pelos atos referidos na alinea a). 


Editor = RTD DU1I-800S5-00-01-FT-TRADODOCK - 9 
* |] matters not, or enly-partly not, regulated by acts referred to-in 
the law of the<!0/> Host Member State"! /> in the case of matters not, or partly point (a); 


not, regulated by acts referred In point (a); TM TRA: Pelo direito do Estado em que se encontra a sua 
ssegment 1501 MT > sede social em relacdo as quest6es que ndo sejam reguladas 
Alei do Estado-Membro de acolhimento no caso de questdes ndo requiamentadas, ou pelos actos a que se refere a alinea a), ou que $6 0 sejam 

nao parciaimente regidas, pelos atos referidos na alinea a); parcialmente; 

<end segment> 


Dctonery Comments Glossary Motes  tultiple Translations 


Promct actesaves cn 0831 57/446 (366) 1763, 1604)| [136/141 


Screenshot 49 — Human Translation and Machine Translation — separate panes 


0 
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F.2. Machine Translation a 


In DGT, Machine Translation is pre-processed for all the EN — other languages Word documents — and for some other 
pairs too — and the tmx file is copied to a dedicated project folder (\mt) — which is DGT-specific — when you create a 
project with the DGT-OT Wizard. 


MT@EC provides machine translation between all the EU official languages — 552 language combinations — some in 
direct mode and others via a pivot language. 


If you want to have an inkling of why MT quality can be so uneven — from very good to rotten — see the Annex on What 
Makes Moses Tick taken from the Moses for Mere Mortals Tutorial. 


Unlike MT services publicly available which are trained with every possible parallel data available (namely on the 
Internet), MT@EC engines are trained with EU corpora, i.e. documents from Eur-Lex and the main EU institutions and 
— of course — with DGT corpora of unpublished translations. In short: our translations from the last 2 decades! 


The quality of MT output — and its usefulness for translation purposes — can be quite different not only between 
language pairs, but also from document to document, depending on the corpora the MT engine has been trained with 
and the document(s) to be translated. 


For instance, for very technical documents in new domains (in EU documents), MT@EC may not have any previous 
material on that domain in its corpora and therefore MT output may be poor. But for other domains where there are lots 
of previous translations, MT output will probably be extremely useful ... even for morphologically very rich languages like 
Finnish, for example. 


So the bottom line is that, even if it may happen that MT output is generally not very good for your language pair, 
nevertheless it may give you useful phraseology or terminology “suggestions” as those segments are, in fact, “snippets” 
(strings from 1 up to seven words) from EU translations. 


@ Therefore, don't discard machine translation altogether if you have a first “bad” experience. With a different 
document, the usefulness of MT may be better ... and the separate MT pane in OmegaT enables you to assess 
that in a very easy way. Terminology/phraseology might be useful, even if the general syntax of the phrases is 
rotten! 


By default, MT insertion in the segment open for translation is automatic if there is no Euramis match within the defined 
minimal similarity threshold. But you can easily deactivate it in the Editing Behaviour Options menu and also define the 
minimal similarity for fuzzy matches from Euramis. 


§@ Editing Behaviour Options 


Please select what text you would like to be inserted into the segment that ts not translated 
yet, when you move to it. 


The source text @ Leave the seqment empty 
¥ Insert Machine Translation 
~ Insert the best fuzzy match 
Minimal similarity: 20° 
Prefix: 
Attermpt to convert numbers when inserting a fuzzy match 
/ Allow transiation to be equal to source 
Export the segment to text files 
Go To Next Untransiated Seqment stops when there is at least one alternative transiation 
Allow tag editing 
Validate toga when leaving o segqrent 


¥ Save outo-populated status 


[OK J Cancel || 
* a | 


Screenshot 50 — Defining match rate and automatic insertion of machine translation output 


1 
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Just to have an idea of MT@EC performance in comparative terms, the MT@EC service carries out (automatic) 
evaluations of the MT performance for EU documents. 


Here is the evaluation for the language pairs EN < other EU official languages, which represent the huge majority of 
translations in DGT. 


This is a dynamic ranking that changes over time, but here is the present “evaluation” given in the MT@EC website: 


EN as source EN as target 
language language 


EN as source EN as target 
language language 


= 


Good for understanding The best you can get Good for understanding The best you can get 


Good for understanding Good for understanding Just a rough idea Just a rough idea 


< 


Good for understanding The best you can get Just a rough idea Good for understanding 


= 


Just a rough idea Good for understanding The best you can get The best you can get 


agsieds] 
n 
a Ee 


Good for understanding The best you can get Just a rough idea Just a rough idea 


r 


The best you can get The best you can get Good for understanding Good for understanding 


uU 


Just a rough idea Just a rough idea Just a rough idea The best you can get 


uv 
4 


Just a rough idea Just a rough idea The best you can get The best you can get 


Good for understanding The best you can get Good for understanding The best you can get 


Just a rough idea Just a rough idea Just a rough idea Good for understanding 


n 


- - 


Just a rough idea Just a rough idea Just a rough idea Good for understanding 


< 


Just a rough idea Just a rough idea Good for understanding The best you can get 
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F.3. Translation Memories — General 


OmegaT allows to easily manage translation memories via Windows Explorer and this is a simple — but powerful — 
feature that is very worthwhile to explore. 


In this Section is gathered — and expanded — information concerning translation memories to give you an overview of 
how to make the best use of it. 


In DGT, Euramis is the translation memories database from which retrievals are extracted to be used in CAT tools. 


® Section C.2. for information on Euramis. 


OT allows you to organise external memories giving them priorities or penalties and this feature is optimised with a very 
powerful Search feature. 


OT also gives you complete control over the memory(ies) you want to use for pre-translation purposes before starting 
the translation of a project or during its translation: 


q You can use the OT auto-populate feature to: 


Y Pre-translate your whole project with 100% matches (including tags) in your external memories, by 
simply copying the memory(ies) you want to use for this purpose to the \tm\auto subfolder or to the 
\tmlenforce subfolder. 


q 888 You can also use the brand new DGT-OT feature Search and Pre-Translate to: 


Y Pre-translate a part of your project searching by terms/strings or by regular expressions and have the 
searched segments pre-translated (in batch): 


(§ By automatically copying the selected source segments to the target segments, 
(§ From external memories with 100% or a lower match 
§ From machine translation. 
y  Pre-translate your whole project: 

(§ From external memories with 100% or lower match 
(§ From machine translation 

© Personally, | don’t recommend the use of this feature to pre-translate a whole document/project 

with lower than 100% matches. Although you will have a prefix with the match rate in the target 


segment, you may not notice some differences and therefore have incorrectly translated 
segments. 


To pre-translate the whole project from external memories, use pre-translation (auto-populate) 
from the \tmlauto subfolder (pre-translating only 100% matches including formatting). 
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F.3.1. Fuzzy Matches results — Misalignments 


Many translation memories stored in Euramis are from documents that have been post-aligned using the source and 
target published/released documents (in their native formats, of course). These post-alignments are generated 
automatically and may — or may not be — checked by an assistant or a translator. 


Although the error margin of the present aligner is very low, it nevertheless exists! 


These post-aligned translation memories may come from: 


y 


y 


All Eur-Lex documents: a part of those alignments have been checked by assistants or translators, but 
another part is from automatic alignments without human checking. 


DGT released documents: a part of the documents translated in DGT for which, for various reasons, the 
memories stored in Euramis don’t come directly from CAT tool memories. 


Other EU institutions memories: which have memories available in Euramis and for which also a part of the 
memories don’t come directly from CAT tool memories. 


Fuzzy Motches 
| 


| ENTR-2014-80065-00-01-EN-ORI-00_EN-PT-RET.tmx (+2 more) 14/05/12 12:27 
Match: <100/100° Source: <MOVE-2012-800280002> - Translator: <britoan> 
(1) -> ORI DIFF 


adie tall 


ENTR-2014-80065-00-01-EN-ORI-00_EN-PT-RET.tmx (+2 more) 08/06/11 18:57 
Match: <100/100%> - Source: <-2010-32010D0174> - Translator: <> 
(2) -> ORI DIFF; ill 

TM TRA: Ill, 


Editor - ENTR-2014-80065-00-01-PT-TRA-00.000K 
List of applicable UNECE regulations 


i on vehicle structure integrity 


00 latch 100/100/100%> 


Screenshot 51 — 100% match — in the source segment — but the alignment is incorrect ... or it applies only to 


a specific situation in a specific document! 


The match rate displayed in the Fuzzy Matches pane is only as reliable as the external memories are reliable, i.e., 
correctly aligned. So don’t take it for granted that a 100% match is — in all cases — really that in the target language! 


™ Therefore always check the translation even of 100% matches unless you are sure they are not from automatic 


post-alignments or that, if they are, they have been thoroughly checked. 


|_| 
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F.3.2. Order of Fuzzy Matches display — options @ 


A new option in OmegaT is that you can define the order in which the match segments are displayed in the Fuzzy 
Matches pane by selecting — in the External TMX Options menu — one of the 3 options for Sort fuzzy matches by: 
Full text including tags and numbers, Stemming, no tags and no numbers or No tags and no numbers. 


™ The 3 match rates are not affected. It is only the order in which the fuzzy matches are displayed that will be different. 


Sry mache by: Ful induig as and pubes. 


Stemming, no tags and no numbers 
No tags and no numbers 


Full text, including tags and numbers 


Please select how tags o! 


= 


| | Display tags 


Screenshot 52 — Sort fuzzy matches by in the External TMX Options menu 


The option Full text including tags and numbers is the most “accurate” for our purposes as it displays first the 
segments with higher matches including formatting and numbers. This is the “safest” option and that is why — contrary 
to the public OmegaT — this is the default in DGT-OT. 


The options Stemming, no tags and no numbers and No tags and no numbers give priority to content and may be 
interesting to use in some cases. 
& If you have heavily tagged documents and the translation memories — if they come from Euramis — have no 
formatting at all, this may be an interesting option. 
But it may also be counterproductive on the whole, especially if you use MT output. 
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F.4, Organising and giving priorities to external 
memories 


Omegart flexibility for organising and giving priorities to external memories — to be used for Fuzzy Matches and Search 
purposes — can really help you in your work and it is therefore worthwhile to give it some attention, especially when 
working with large and/or complex projects. 


F.4.1. Organising external memories 


You can have as many reference memories as you want as OmegaT can handle many MB of memories in our service 
computers without noticeably losing speed. 


Therefore, the approach taken in the DGT-OT Wizard is that most of the time you will want all the translation memories 
available in Tradesk imported to your project, so you are not asked — when creating or updating a project — which 
memory files you want to import to your project. They are all imported! 


But if you have a project with some/many documents and which may have new versions — and therefore updates — it 
may be useful to organise your memories — even if you don’t want to give them priorities — as when you update a 
project the DGT-OT Wizard will copy again to your project \tm folder all the memory files available in Tradesk, both the 
ones already in the project and the new ones. In the example below, | created the: 


y \tm\RETRIEVALS subfolder to where | dragged and dropped the retrieval and Celex-titles files (in blue) 
y \tm\REFERENCES subfolder to where | dragged and dropped the reference document files (in green) 


irr] This can save you time if you do an Update of the project with new versions, as you can have the folder display 
ordered by date and easily select, in the \tm folder, the new memories that have been imported (the most recent 
by date) and which you want to keep by dragging them to one of the subfolders — or creating new subfolders — 
and just deleting the files you already have in the subfolders or that you don’t need. 


This way, you don’t have to reorganise the whole \tm folder every time you do an update! 


\tm folder after creating a project with the DGT-OT Wizard \tm folder organised with subfolders 
without priorities 


or : e 3 =e 
800253-00-00-EN-ORI-00_EN-PT-RET.trrnix 
7014-80073 -00-01-EN-ORI-00_EN-PT-RET.trmx 
-2014-80022-00-02-EN-ORI-00_EN-PT-RET.trmx 
~2014-80023-01- -EN-ORI-00_CelextNn-PT-ALtmx 
2014-80023-01 EN-ORI-00 EN-PT-RET.trmx 
~2014-R0027%3-01 -FN-ORI-00_CelexfN-PT-Al 
2014. 80023 .01.01.EN OR!1.00_EN.- PT -RET.trmx 
2014-80023°01°-02-EN-ORI-O0O_EN-PT-RET trix 
-2014-80023-02-00-EN-ORI-00 CelexEN-PT-AL 
2014-80023-02-00-EN-ORI-00_EN-PT-RET.trmx 
2014-80023 02-02 EN ORI. 00_EN.- PT-RET.trmx 
-~-2014-80025-03-00-EN-ORI-00_CelexENn-PT-AL. Unix 
CNECT~2014-80073-03-00-EN-ORI-00_EN-PT-RET.trmx 

-2014-80023 -04-00-EN-ORI-00_CelexGiN-PT-AL.tenx 
-2014-80023-04-00-EN-ORI-00_CN-PT-RET.trmx 


-2014- 


|. RETRIEVALS 


-O1 tmx 


unix 


) CNEFCT-2014-80027701027_EN-PT-DWN trix 
NoDG 2002. 320020D0676_01__EN PT ._OWN1.trnx 
NODG-2002-3200200676_EN-PT-OWN1.t1mx 
NoDG-2006-32006D0771 01 _EN-PT-DWN1.tmx 
NoOG.-2006-3200600771_EN-PT-OWN1Ltms 


NoDG 2010-32010D0267_EN PT DOWN 1L.trmx 
NODG-2012-3201200243_01_ _EN-PT-DOWNL.tmx 
NoDG-2701 27-3201 2002743_EN-PT-DWN1.tmx 

" NobG-2012-520120DC0527_EN-PT-OWN1,trmx 


Screenshot 53 — Organising the memories in the \tm folder by groups/subfolders 


Me 
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F.4.2. Giving priorities to external memories individually 


In OmegaT, you can give priority to memories by individual tmx files — or, most importantly, by folders containing several 
tmx files (“2 Section F.4.3. below) — by ranking them via Windows Explorer. 


This is a feature that can save you a lot of time in complex multi-document projects and much more so as it is combined 
with a powerful Search feature. 


As for Fuzzy Matches and Search purposes, OmegaT “reads” the files in the project \tm subfolder by alphanumerical 
order, you can easily define and change the order of the files, if you want. 


You can simultaneously give the files a descriptive name so that you can concentrate on your translation work and not 
have to think of your reference documents by number. 


You can organise the memories automatically imported to your project when you create it using the DGT-OT Wizard 
before you start translating or at any time during the translation. 


Those memories are stored in the \tm subfolder and can be ordered just by renaming them adding a number before the 
file name and also a descriptive name if you want. 


Example: 1-2012-32012R1268_EN-PT-DWN.tm<x, etc. or 1-Financial-Regulation-32012R1268-EN-PT.tmx, so that the 
name displayed in the Fuzzy Matches and Search panes is meaningful to you. 


You can rename all or only some of the translation memories as in the example below. The numbered tmx files will be 
given priority. 


}. auto 

4) 1-Finantial-Regulation-NoDG-2012-32012R0966_EN-PT-DWN.tmx 
4] 2-FR-Rules of application-NoDG-2012-32012R1268_EN-PT-DWN.tmx 
4 NoDG-2011-32011R0282R_01_EN-PT-DWN.tmx 

4 RTD-2013-80050-00-00-EN-ORI-00_CelexEN-PT-AL tmx 

4 RTD-2013-80050-00-00-EN-ORI-00_EN-PT-RET.tmx 

4 RTD-2013-80050-01-00-EN-ORI-00_CelexEN-PT-AL tmx 

4 RTD-2013-80050-01-00-EN-ORI-00_EN-PT-RET.tmx 

4) RTD-2013-80050-02-00-EN-ORI-00_CelexEN-PT-AL tmx 

4 RTD-2013-80050-02-00-EN-ORI-00_EN-PT-RET.tmx 

4 RTD-2013-80050-03-00-EN-ORI-00_CelexEN-PT-AL tmx 


Screenshot 54 — Ranking individual memories in the \tm subfolder in this case giving priority to some 
memories 
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F.4.3. Giving priorities to external memories by subjects/subfolders 


In OmegaT, you can also give priority to memories by groups/subfolders containing several tmx files by ranking them via 
Windows Explorer. 


This is even better — mainly in complex and large projects — than organising them by individual files as explained in the 
previous section. 


& You can even organise main reference memories of domains you translate frequently — for later reuse — without 
the need to merge memory files ... as after a time you may not remember what reference memories you have 
there. 


This way, you can also update those thematic memory folders by easily adding new memories or deleting old 
ones in Windows Explorer in a few seconds... and always knowing what is there! 


You can organise those memories thematically by subfolders of the \tm folder of the project — to where you can copy or 
drag/drop the tmx files you want — and give those subfolders the name and priority you want. 


In the example below, | created 8 new subfolders in the \tm folder, gave them a priority (number) and a name meaningful 
to me to have the segments from those memories identified in the Fuzzy Matches pane and in the Search window. 


- 
c* 


Name 


l 
NoDG-1994-21994A1223_01_EN-PT-DWN1.tmx 


NoDG-1994-21994A1223_02_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_03_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_04_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_05_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_07_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_08_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223 09_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_10_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_11__EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_12_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_13_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_14_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_15_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_16_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_17_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_18_EN-PT-DWN1.tmx 
NoDG-1994-21994A1223_19_EN-PT-DWN1.tmx 

3) NoDG-1994-21994A1223_20_EN-PT-DWN1.tmx 

3) NoDG-1994-21994A1223_EN-PT-DWN1.tmx 

3) NoDG-1994-31994D0800 EN-PT-DWN1.tmx 
Screenshot 55 — Ranking memories by groups/subfolders. In this example, showing the tmx files in the 

2-GATT-Agreement-1994 subfolder 


You can also just copy/paste folders you have in your computer with thematic translation memories and you can change 
at any time the priority and name given to those subfolders without the need to Reload the project — if it is open — as 
OT will take any changes in consideration when opening new segments or doing new searches. 


a & & & & 


ion 


bed 


0-Eur-Lex-titles 
1-COMBINED-NOMENCLATL 
2-GATT-AGREEMENT-1994 2m> 
3-REG-Protected-names 
4-AGREEMENTS-EUR-LEX 
5-MEM-NORM-Ag-2014 
6-OTHER-REFS 

7-RETRIEVALS-SEL 
8-REF-ELARG-2007-1256-SEF 


auto 


a) 


errr rrrrr er 


W 


af & & & & & sf 
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In the Search window, the segments are displayed by the order in which the (memories or) groups of memories were 
ranked and you can resize and place this window beside the Editor pane if you want. 
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| quatties of the prodidct, om the bate or. exter packeging. ndverselig moteriel or _ ||| smebor. scomas, gu omeeres 
nr imante ralatinn tr that nrntict anc the mertine af the monchart im = ramseinas Dahle i 
Derry Coreeeete  Geerary etre Bettiety Tremwaeteny 
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Screenshot 56 — Search window with results ranked by order of priority given to reference memories folders. In 
this example is also used the tmx2source feature. 


In the Fuzzy Matches pane, the matches will be displayed, first, by order of match rate and, with identical match rate — 


formatting and numbers included by default — by the order of priority you have defined for the reference 
memories/subfolders. 


I 
| |@rapent Gait te io Yew looe Gutum tur 


| BBem Oe Aidiavys 7.70 49 RFrRO 


4 


PRES «Re eR -@c ae -eon 
_Ambiziogas reformas estruturels nos mercacos de prosutes @ Ge trabains, que * | Orrhan segments 1205/15 13:30 =] 

coninbuem pars gunemiar & procdutividede, a ccepetitividade @ o investimeno Match <92/94/91%> « Source: <=> - Transistor: <> 

_ | (1) > ORE DIFF: aretious\momous structure! reforms in procuct sery ce and lebour markets that comtibute to 
increasing productivity, competitiveness and investnent 

| ROPPeerererer reretee ee crate ae ee beets wr garentrieresicane ur tia - co J Th TRA Retormas estrututain ambicioses nos meecacos Ge produtos @ do trabalthe que contrmuans 

investment Pian for Europe. * | para um suments Ga procuticade, da competi dace @ do investmento 

| Tal @age a @bwnagdo dos cbstaculos a0 financiaments @ o langamento de 

projetos Ce investments, bem como ume rapida implementag So do Plans o 

Serestwrerea terse Europe. nee ” 2-REFS.2019\N0DG.2014.52014DC0902_EN-PT-OWNt tmx (+3 more) OS/01/15 15:53 

Match: <6.2/55/59%> « Source: <-2014-52014DC0902> « Transtator: <> 

| Transtation fest modified by machemne on 12-May-2015 et 18.35)35 (2) > ORI DIFF: Arnbitious np iermontetiern of structural reforms in product, serveosseryvice and 

“Ambitious structural reforms in prosiuct, nencice end iabour maskets that labour markets centhat contribute to increasing productivity, regaining Competitiveness and 
_sontribute te increasing pieductivity, compatitivanoss snd investment Pecan ait ae nnn tr Ret cla one abe heke 
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rk sarneniphanontaninle ore of prosperity. lad (3) > ORI HEF: Howeverambr oun nec ALTeCSeRA NES nienpesoeeaeit con pee peeneet 

AS promover @ erlagdo de empregs @ © crescimerte, 3325 reformes Gnd reore could Se Gone. aur Ma suite twat oc 2 10 taekiey Grammin, LAB ibs Ved ty Compe liye 
|[contribuirho pare une partion mas aenpia de prospericiade and ear ent 

Ti TRA Todavia, venitice-ce a perssténcis Ge problemas estrutura’s no mercado de wabaino @ 540 

Reforms in the functioning of financial markets will support a Gurable ieee ideale en corde ipecktcay uorearce 

rebalancing in the economy, eese access to finance for Investment and ae 

Geteveraging 

=" tn the banning. prtvete and D-BUROPEAN-SEMESTER-201METO1ISCOSTS-EN-PT-AL tome 17/01/14 19 38 

As raformes no funclonaenanto dos mercados fnanceiros apotaro unm Match <4235/33%> - Source: <-2013-520 13SCO379> - Transistor, <> 
‘reequilibrio duradoure da economia, facitarSo o oceseo ao fnanciamento ce (4) > ORI CFF: Structure amoricws atiuctural reforms in product. service and seeviceiabour markets [het 
ivestrrentes @ atenuarSo © impacto Negatvo oe redugao do nivel de cocinibule co creasing productivity. competiiveness anc invesiment 
“encivisaments nos setores Sencano, privedo @ puBuco a TH TRA Retommas esirutura’s nos mercados de produtos @ sernijos Zs 
Cnty Commmeete Groner thitete Tense Mowe 


(iaasie rire Sea aa 
Screenshot 57 — Fuzzy Matches pane with matches displayed in the order of priority given to \tm subfolders 
with the display option Full text including tags and numbers. 
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Furthermore, in the Fuzzy Matches pane you also have information about the number of occurrences of a particular 
segment in the external translation memories — and also in the project memory — by right-clicking on the mouse. 


You can also insert or replace the text in the segment open in the Editor by the text in the Fuzzy Matches pane by 
selecting the desired option in this menu. 


re oC — 
7s 3a2 49 8 O 


<i><Orphan segments (+9 more}> <23/01/14 15:09> ¢ 
Match: <100/96%> - Source: <-> - Translator: <></i> 
(1) > TM ORI: the confidential information Is subsequently communicated to the recipient without any obligation of confidence by a third party 
who Is in lawful possession thereof and under no obligation of confidentiality; 

ORI DIF: the <t0/>confidential information <t1/>is subsequently communicated to the recipient without any obligation of confidence by a 
third party who is in lawful possession thereof and under no obligation of confidentiality; 

TM TRA: as informacées confidenciais forem subsequentemente comunicadas ao destinatario sem qualquer obrigacaéo de 
confidencialidade por um terceiro que esteja na posse legal das mesmas e que nao esteja vinculado pela obrigacdo de confidencialidade; 


Insert Match Into Translation 
Replace Tracutation with Match 


Screenshot 58 — List of occurrences in the project with the same segment (in this case 9 more) and options to 
insert or replace translation in Editor. 
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F.5. Giving a penalty to external memories 


Sometimes, it is useful to distinguish between reliable translation memories and less reliable ones. 


Machine Translation doesn’t need to have a penalty because it is stored in a separate folder (\mt) and is only displayed 
in its specific pane, therefore it is never mingled with human translations from Euramis, either in the Fuzzy Matches 
pane or in the Search window. 


Furthermore, OmegaT accepts translation memories with the same source language and a different target language 
from the one of your project, giving give you results in the Fuzzy Matches pane and in the Search window for those 
other target language(s) too... if there is a (100% or lower) match in the source language, of course. 


& If you want to see those segments displayed as a second source language in the Editor — but in that case only 
when there is a 100% match (including tags) — you can use the tmx2source option. “ Section F.6. below. 


So you can have reference memories with a different target language to use as reference for terminology purposes, but 
you may want those segments to be displayed after the project language pair matches. 


In that case, you can give a penalty to one or more of those memories by creating a subfolder (or more than one 
subfolder) with names like "penalty-xxx" where xxx is a number from 0 to 100. 


A percentage penalty corresponding to the numbers will be applied to the matches of all translation memory files within 
such subfolder. For instance, a 100% match will become a 70% match with the example below. 


The 3 percentages (with and without tags) are affected equally so. For example, if the matches were 75/85/90, they will 
become 45/55/60 as the penalty is applied. 


& As OT will display matches in the Fuzzy Matches pane up to the number of matches defined in the Options — 
External TMXs options menu (default: 10) — and even if those matches are below the minimal similarity 
threshold defined the Editing Behaviour menu — you can choose to give a penalty lower than the minimal 
similarity threshold and even so have those matches displayed in the Fuzzy Matches pane. 


Memory subfolders in 


the \tm folder BOSCH SE AKAVY( 59049 8S2B0 
roe ee -¢0 jae -&0)) 
7 PRISE equipement for utlinade para ins profigsionais ¢ clo prefissicnais, co escala local para “ 2-Reference-CNECT.2014-900250100_EN-PT-OWA tmx {+3 mora] 1084 19:44 
Name seacnctincalhuecnttion And Bolts ~ Match: <100MOD100%> . Source: <CNECT-2044-2002301 00> - Translator: <machame> 
ies oR ah nT aw Te 2004 =o {1} > ORI CHF: PMSE oquipmont is used for professional and non-professional purposes, 
social ¢ ds ncvatie Ge entreleninerts ne Unido * from focal to Union-wide events. 
h la-NormMem TU TRA: Os equipamentos PUSE so utiizedos pers fins profissionais © 
They inchade brosdcasting, cabtural, musical end theatricel performances, and social ndo-pevfissionsis, em eventos desde 2 escals local até & escale de Unis, 
}. 1-REFERENCES pemvboe-taheasyy 
inclvers a radiodedo. representa; des cufurels. rusicgls ¢ leetre’s ¢ eventos sovigs ¢ 
}. 2-RETRIEVALS oesoornios persty Sime CNECT-2015B00COOEN-ORC-LO_ENFRAL tru SBOE 16:17 
Match. <TOTOTON> - Soce <ONECT-214-82INNS - Trarsigtor. > 
h auto Tracsiation last moditec ty mactarte ot (fun2074 at 140225 (2}-> ORI DIFF. PMSE equipment s usec for professions! and ronprolessione! pupeses, fon 


PUSE equipment is used for professional end non-professional purposes, from localto ff bcs bb Unon-nece eves 
h penalty-30 Unionwide events. 7M TRA Les Goupements PISE sont ulllses 4 Ces fins profesnocneties ef non 
Segment 071 “TRA” > professornehes. pour des evéremerts oceus comme 6 fécheie de [nion 


Screenshot 59 — Display in the Fuzzy Matches pane of matches from segments with a 30% penalty applied, in 
this case the French translation of this same document. 
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F.6. Translating with the help of a relay language @ 


This is anew Omegal feature which may be very useful if you have a document to translate from a language you don’t 
master (very) well and for which there is already a translation into a language that you master. In this case, you would 
probably like to have the possibility of seeing both languages as source languages. 


™ Of course that, if you don’t know the original source language at all, there is no point in seeing it and therefore 
just create your project having as source language the translation into the relay language. 


In that case, you may have to request retrievals and MT as it may not be pre-processed for that language 
combination. 


To have the relay language displayed in the Editor as a source segment, you can now use the tmx2source feature to 
use a (reference) external translation memory with the source language identical to the source language of your project 
and the target language of the reference memory different from the target language of your project, displaying the relay 
language segment in the Editor below the original source segment. 


Example: You have an LT document to translate into PT and your knowledge of LT is not perfect (or far from it)... but 
there is already an EN translation of the same document. In this case you can create the LT-PT project and use the 
LT-EN external translation memory to have the EN relay language displayed in the Editor below the LT original segment. 


This way you will be able to save time as you will see both the LT and EN as original segments. 


update 5 «: 1-PP-2015-00248 of XX. 


\tm subfolder: nner 


b auto \BBSCMSEACAVI <Q KO MS PRO 
J. tmx2source Machine Translation -60 


“ 


Mais de 85 por cento do comérclo eletronico clientes afirmaram que 0 fator mais importante é o custo de 


\tm\tmx2source transporte, 
subfolder: 
Edhor ~ FP-2015-00248-00-00-FT-TRA 0) doc = - we 
4) EN_GB.tmx 


Daugiau kaip 85 proc. e. prekybos kKlienty saké, kad perkant internetu svarbiausias veiksnys yra 
pirkiniy siuntimo kaina. 

EN-GB: More than 85% of e-shoppers say delivery price is the most important factor when buying online 
<segment 0024 MT > 

Mais de 85 por cento do comercio eletronico clientes afirmaram que 0 fator mais importante ¢ o custo de 
transporte, 


<end segment> 


ee 
> 4 


Oxtionary Comments Fuzzy Maiches Notes Glossary Multiple Translations 


‘Proget autesaved 02 11:00 12/57 (12/57, 57} [120/115 | 


A 


—— 


Screenshot 60 — Translating a LT document into PT with the help of a relay language (EN). The EN translation is 
displayed in the Editor below the original segment as a second source language identified with the relevant 
language code. 


eB 
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You can also use this feature just to have the translation into another language displayed for terminology purposes, as in 
the example below in which — for an EN-PT project — an EN-FR alignment of the same document was used. 


Ge You can also associate the penalty and the tmx2source features as in the example below as in the Editor will only 
be displayed the 100% matches (from the \tmx2source subfolder), while in the Fuzzy Matches pane you can have 
also below 100% matches from the \tm other (sub)folder.. 


This is important because, as translation memories from Euramis are cleaned of formatting, if your document is 
(heavily) formatted and you are using an extraction or alignment from Euramis, you won't see many segments 
displayed in the Editor as a second source language because the match is below 100% due to formatting! 


\tm subfolder: Editor ~ CHECT 2014-0002) 01 Ol I"'T-TRA- 00.00% 
Translation last modified by machame on 20-Jun-2014 at 09:34:12 - 
Wired and/or wireless PMSE systems are used in a wide variety of events, from local 

and non-professional to EU-wide touring events. 

FR-FR: Les systémes PMSE cablés et/ou sans fil sont utilisés 4 l'occasion d'événements trés 
divers, qui vont des manifestations tocales d'amateurs aux tournées dans toute I'UE. 


Frere 


\tm\tmx2source <segment 0211 “TRA™ > 

subfolder: Os sistemas PMSE com e/ou sem fios sdo utilizados numa grande variedade de eventos. desde 
eventos locais e nao profissionais ate eventos em digressao a nivel da UE. 
<end segment> 


"| FRLFR.tmx 


Screenshot 61 — Translating an EN document into PT using the FR translation as reference displayed in the 
Editor as a second source language 


The \tmx2source subfolder is not automatically created when the OmegaT project is created (either via the DGT-OT 
Wizard or directly in OT), so you will have to create it manually and copy to it the file renamed with the language code of 
the reference memory target language in the format shown below. 


Just copy the alignment and rename it with the target language code. In the 2 examples above, the memories were 
renamed LT_LT and FR_FR. 


You can see a list of the language codes either in OT — under Help — or in the Project — Properties menu — 
Source File Language. 


GO Ean Project 


Et propect propertios here 


hegrmentaten 


Fae Fuhorn 


o~ Carvcet 


Screenshot 62 — Project — Properties menu — Source File Language 


You can add this subfolder at any time during the translation of your project and you can also delete it — or move it to 
another location at any time — without the need to Reload the project. 


oe 
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F.7. Pre-translate (auto-populate) the whole project 


The pre-translation feature may be useful if you find a particularly reliable document you want/have to use for 
pre-translation purposes, for instance, if you are translating a legislative act that modifies or repeals a previous one and 
you want to change the previous translation as little as possible in your new document. 


You can pre-translate your whole project using one or more reference memories by copying it/them to the \tml\auto 
subfolder of your project. 


The segments in the memory file(s) copied to the \tmlauto subfolder which are 100% matches (including tags) will be 
automatically transferred to the project memory — an operation which is in OT is called auto-populate the project 
memory. 


Anew OT feature is that when you open your project (or do Reload), those segments are displayed in the Editor, by 
default, highlighted with an orange colour to call your attention to the fact that they were pre-translated. They 
will remain so highlighted unless you modify those segments. 


The translator's login from the reference document used for pre-translation (in the example below costami) will remain 
unchanged if you do not open and modify that particular segment. 


If you have the option View — Modification Info — Display for Current Segment active in Omega’, the identification 
of the pre-translated segments will be displayed with the login of the translator — which may be yours or somebody 
else’s. 


Editor - RTD-2013-80050-00-00-PT-TRA-00.DOCX oO 


Translation last modified by costami on 20-Nov-2013 at 13:21:46 
THE EUROPEAN COMMISSION 

<segment 0004 **TRA*™* > 

A COMISSAO EUROPEIA, 

<end segment> 


Having regard to the Treaty on the Functioning of the European Union, 
Tendo em conta o Tratado sobre o Funcionamento da Unido Europeia, 


Having regard to Council Regulation (EC) No 723/2009 of 25 June 2009 on the Community legal framework for a 
European Research Infrastructure Consortium (ERIC) (<t0/>), and in particular point (a) of Article 6(1) thereof, 


Whereas: 
Considerando o seguinte: 


Screenshot 63 — New OT feature: By default auto-populated (pre-translated) segments are highlighted with an 
orange background. 


This feature is very useful if you have a previous version of a document that has already been released and you want to 
use its memory for pre-translation in a newly created project. However, if you still have the previous project in your 
computer — and you finished your translation using OT — you can just update the old project in the usual manner with 
the new version without the need to create a new project. 


™ If it is an ongoing project, there is no need to do pre-translation at all if you have new versions of documents. The 
project memory stored in the \omegat subfolder contains all the segments you translated in that project and they 
are automatically displayed in the Editor when you open that project after updating it via the DGT-OT Wizard. 


oe 
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F.7.1. Pre-translating before starting — or during — the translation 
of a project 


If you want to pre-translate before starting the translation of a new project, it is just a matter of copying the reference 
document(s) you want to use for pre-translation to the \tm\auto subfolder of your project and open the project. 


All the 100% segments (including tags) in that memory will be automatically transferred to the memory of your project 
and displayed in your document(s) in the Editor pane highlighted in a | colour (by default). 


If you want to use this feature after having started the translation of the project, do the same: just copy the memory to 
the \tm\auto subfolder of your project and open the project or do Reload. But in this case, the 100% match segments in 
the memories in that subfolder will be automatically inserted in your document(s) only in segments that you have not 
translated yet and which, therefore, have the untranslated status and are not in your project memory. 


As your project memory has precedence, the segments you had already translated remain untouched and the 100% 
matches (formatting included) from the external memory will be displayed — first according to match rate — in the Fuzzy 
Matches pane. 


This is important if you find a reference memory you want to use for pre-translation purposes after starting the translation 
of a project. Your translation will not be overridden by the segments in the \tmlauto files. 


You can leave the memory there as — contrary to what happens to the files in the tm\enforce subfolder — the 100% 
segments in the external memory in the \tmlauto subfolder do not override the project memory. 


This may even be useful if you receive new versions of one or more documents in the project, as the external memory in 
the \tm\auto subfolder will continue to pre-translate the untranslated segments, if any, in the new document/version that 
have 100% match in the memories in the \tmlauto folder. 


F.7.2. Giving priority to memories for pre-translation 


OmegaT reads the files in the \tm\auto subfolder by alphanumeric order. So, if more than one memory is copied to the 
tmlauto subfolder and there are several identical source segments with different translations, OmegaT will read all the 
files by alphanumeric order and transfer to the project memory the translation of the last occurrence in the last memory 
in the tm\auto subfolder. 


The other (conflicting) translations will be ignored and will be displayed in the Fuzzy Matches pane when that particular 
segment is opened. 


lf there are (many) segments with alternative translations, OT will take the previous and next segments of each 
alternative translation into account and transfer to the project memory the last occurrence in the last memory 
(alphanumeric order) in the tm\auto subfolder which has the same previous and next segments. 


Keep in mind that, in DGT-Omegarf, “alternative translations" are document-independent by default, i.e. OT accepts that 
status for non-unique segments in all the documents of the project without “looking” at the number of the document 
(which is recorded) and only “looking” at the previous and next segments. 


The reason for this default in DGT-Omegat is that frequently there are new versions of documents arriving during the 
translation or revision process and alternative translations from a previous version of a same document would not be 
taken into account if the “alternative translations” were document-dependent as the number of the new version would be 
different. 


If you want to make it document dependent, just go to the Options — File Filters menu and unclick the option Ignore 
File Context when identifying segments with alternative translations. 


& If this feature is important for your project, check that this option is defined as you wish! 
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To give priority to one or more memories, it is just a matter of renaming the memories in the tmlauto subfolder by 
(re)numbering them in the inverse order of priority so that, in case of different translations for an identical (non-unique) 
segment in the project, the translated segment transferred to the project memory comes from the tmlauto memory to 
which you give priority. 


Example: If you have memories A, B and C, to give priority to the segments coming from memory B in case of identical 
source segments, just rename the memories giving that particular one the last number: 

Y 1-MEMORY-A; 2-MEMORY-C; 3-MEMORY-B 
and the translated segments from Memories A and C will be ignored and not transferred to the project memory. They will 
be displayed in the Fuzzy Matches pane like any other match, before segments in other memories in the \tm folder (if 
any). 


™ This is the opposite of what is done when organising external translation memories to be displayed in the Fuzzy 
Matches pane or to be used with Search! 
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F.8. Search and Pre-Translate me 


This is a completely new feature specific to DGT-OT which allows you to search by some criteria and pre-translate the 
searched segments, either copying source to target or pre-translating — from the external memories or from machine 
translation output — the selected segments. 


You can select the Expression mode and the Word mode, pre-translate only untranslated segments or also already 
translated segment (if you are already in the middle of your translation), memorize terms/strings/regular expressions and 
filter those entries by author or translator and date. 


You can also configure the display of attributes. “> Part O for information on how to do it. 


® Section G.1. for an explanation of the effect of the Expression and Word modes and the other options, as it works 
the same way as for Search Project. 


RESOVO METI COmmm a SIO Ne Poe BL) lee mtTien orem fe 2) -> GPS OPP: UBT OP TAPE TRIN. TES 
. 


CORRE HES PO8ls CF Kinew® Pare Orttes MOaNleN €8 LE aD n 
1 MENCOES f TEPC DE GUALIOADE 


Anema (ArtOne Ba) os -@c 
Areru h (oreye DS _ 


Cetertinen of "hety beet” 
nn een 


2 amy heats 


1) CRS toons 2) 
Aracs ® uartige 28° 


Keenve tall eamemecinne Rew EL) aur tessttiaal peeeher te 
Con 


Renebee pertels Ge Kewere pare p-edetes oy iciee de UE wee ee 


Screenshot 64 — The Search and Pre-translate window 


™ Please note that in case you have more than one match with the best score, this screen provides no way to decide 
which one will be inserted as it depends on memory ordering. 
® see Section F.4.3. for more details. 
Don't forget to check in the screen what will be inserted before confirming! 


F.8.1. Translate as source B® 


This feature can be very useful when you have documents with hundreds or thousands of segments only with numbers 
that you may want to “translate” in a batch so that they are counted as translated in the statistics and OT doesn’t open 
them with the command Next untranslated segment as they were previously translated in a batch operation. 


This was in fact the reason why this new feature was first designed: to allow you to “get rid” of segments, for instance, 
only with numbers. 


In the example below, there were almost 5,700 segments (out of 18,000 segments in the project) which were numbers 
(headings of the Combined Nomenclature). 


With this new feature, in a few seconds, all those segments were pre-translated (by copying the source to the target 
segments) using the regular expression “[“a-zA-Z]*$b 


sr 
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Screenshot 65 — Search and Pre-translate of full segments only with numbers 


F.8.2. Translate as match B&@ 


You can translate a project from the memories in the \tm folder defining the minimum match rate — Match minimal score 
— for pre-translation purposes and searching for exact/keyword or regular expressions. 


This may be useful if you have segments with similar text in which — for instance — only the number changes, like 
“article 1” etc., Section 1 etc. Or if you want to pre-translate segments with a certain term or expression. 
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On Rarmonised technical conditions of radio spectrum use by wireless audio programme making and special events equipment in the Union 
[100 %Jreletiva 4s condigoes técnices harmonizadas de utilizegao do espetro radioelétrico por equipamentos audio sem fios na realizacao de 
Programas © ovontes especiais na Uniso 


55> 

This decision sire to harmonise the technical conditions for the availability and efficient use of radio spectrurn for wireless audio equipment used for 
Programme making and special events (“PMSE*) 
[100 %JA presente decisdo visa harmonizar as condigdes técnicas relativas 4 disponibilidade e utilizagéo eficiente do espetro radicelétrico para 
equipamentos 4ucdio sem fos utilizados na realtzagao de programas ¢ eventos especiais («PMSEs). 


58> 
(1) ‘wireless audio PMSE equipment means radio equipment used for transmission of analogue or digital audio signais between a linited number of 

\| transmitters and receivers, such as radio microphones, in-ear monitor systems or audio Sinks, used mainty for the production of broadcast programmes or 
private or public social or cultural events: 

|| (100 %)«equipamento PMSE Aucio sem fos»: o equipamento de radiocomunicagoes utilizado para transmissao de sinais audio analégicos ou 
digitais entre um mdmerc limitade de emissores e recetores como, por exemple, radiomicrofones, disposttivos intra-auriculares de monitorizagao 


ou ligagoes audio, utilizado principalmente para a produgao de programas de radiodtfusao ou eventos socials ou culturats publicos ou privados; - 
Chee | Tremadate wtermctive | Traendate all = 


Screenshot 66 — Pre-translate with Match from the external memories for segments with a certain term. 
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You can search by regular expression and DGT-OT will display the results — which you can check — of the segments 
filling those criteria and of the translation that will be transferred to your project memory. 


Those segments will be displayed in the Editor with a prefix indicating the match rate (100% or lower) between brackets 
to call your attention to the fact that those segments were pre-translated and to their match rate. 


For OT, those segments are considered translated and counted as such in the statistics. 


When you have verified/changed all those segments, you can do a Search/Replace to eliminate that prefix in a batch 
operation, unless you have done it already segment by segment. 
You can also pre-translate the whole project, just by using — in the Search for field — the simple Regular Expression 


In that case, in the Search and Pre-translate window will be displayed the list of all the segments in the project. 


™ If you project is really big, this may take some time. 

& Personally, | don’t recommend using this feature for the whole project unless, for instance, the documents in 
your project are heavily tagged and with a pre-translate below 100% you may get 100% matches in terms of 
content. 


F.8.3. Translate as machine translation &@ 


The same applies to pre-translate using machine translation output. In this case, all the segments fulfilling your search 
criteria will be pre-translated — if you have the relevant MT tmx files, of course — and the segments in the Editor will be 
displayed with the prefix [MT], 
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[MT] Relative 4 harmonizacéo das condicées téecnicas de utilizacao do espetro de radiofrequéncias por audio sem fios, servicos de realizacio de 
| programas ¢ eventos especiais de equipamentos na Unito 


55> 

This decision alms to harmonise the technical conditions for the availability and efficient use of radio spectrum for wireless audio equipment used for 
programme making and special events ("PMSE") 

 (MTJA presente decisao tom por objetivo harmonizar as condigoes técnicas para a disponibilidade o a utilizagcao oficionte das radiofrequéncias 
"para equipamentos de audio sem fics uttlizados para o8 servicos de realizacao de programas @ eventos especiais (PMSE}, 


| 
\ 
i, 
| 
| 
| 
h 
58- 
|] (2) Wiretens audio PMSE equipment means racic equipment used for tranamiasion of anniogue oF digital audio signats between a limaad number of 


transmitters and receivers, such as radio microphones, in-ear monitor systems or audio links, used mainly for the production of broadcast programmes or 
private or public social of cuftural events; 

(MT](1) «PMSE éudio sem fios de equipamentoss, os equipamentos de radio utilizados para transmissao de sinais audio analogico ou digital entre 
um ndmero limitado de transmissores © recetores de radio, como, por exemple, microfones, oquipamentos intra-auriculares doe monitorizacao do 
canals ou ligagées audio, utilizadas principalmente para a producao de programas de teledifus4o integrals ou social publico ou privado ou eventos 
cuiturais; 


oe Transtate oneracmwe lances at 
Qe ee re ee ert terre re 


Screenshot 67 — Pre-translate with Machine Translation for segments with the same term as in the previous 
screenshot. 


™ Take into consideration that the prefix is really part of what is inserted in the target segments: if you do not delete it 
yourself, it will also appear in the project memory and in all generated files, including the target document! 


& The most practical way is to do a Search/Replace to delete them in a batch operation after checking them all. 


8 
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F.9. Giving absolute priority to an external memory — 
Enforce @ 


This is a new feature that can be very useful — if handled with care — when the translation of a project is split among 
several translators and there is a high rate of non-unique segments (repetitions). 


Even working in real-time share mode with TeamBase, it is possible that inconsistencies crop up among documents (or 
parts of documents) translated by different translators and/or that are at different stages in the translation or revision 
process. 


Therefore, the reviser — or the (lead) translator — in a shared project may manage these inconsistencies by using a 
global project with all the parts/documents together and select the memory(ies) that have precedence and even override 
the project memory in the global project. 


This feature may also be useful — even in a project with only 1 translator and 1 reviser — if the revision is done 
progressively (in “cascade”). In that case, the translator may want to override the translation in non-revised documents in 
his/her project with the memories from documents already revised. 


The \enforce subfolder is not automatically created when the OmegaT project is created (either via the DGT-OT Wizard 
or directly in OT). You have to create it manually in Windows Explorer if you want to use this feature. 


You just have to copy to the \tmlenforce folder you create the memory you want to use for this purpose and open the 
project or Reload it. 


The 100% segments (including tags) in the files in this subfolder are the only ones that have priority over the project 
memory and — unlike the files in the \tmlauto subfolder — will even replace any segments already translated (saved in 
the project_save.tmx). Furthermore, that memory will continue to override any changes you make to the segments in 
your project afterwards. 


When you open your project (or do Reload), those segments are displayed in the Editor, by default, also highlighted 
with an orange 1 colour — as in the pre-translation using the \tm\auto subfolder — since they are considered 
“auto-populated” from an external memory. They will also be identified with the login of the translator of those segments 
in the external memory. 


They will remain so highlighted and with the other login unless you modify them therefore making them your own. 


After opening, or reloading, the project — as the segments are immediately transferred to the project memory — delete 
that folder or transfer that memory to the main \tm folder as otherwise the segments in that subfolder will keep overriding 
the segments you change afterwards and which have a 100% match (tags included) in this memory. 


™ So be extremely careful and don’t forget to delete the \tm\enforce subfolder if you don’t want to see your 
subsequent changes continuously overridden. 
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F.10. Sharing external translation memories on a server 


If you have a package that is being translated by several translators, besides working in share mode via TeamBase to send 
and receive — in real-time — the segments translated in that particular project, you may also want to share the external 
memories to be used by all the translators involved in a particular project. 


™ If you want to use this in a project while teleworking, check that there are no connection/speed problems. 
Usually it will be the lead translator/coordinator in your Unit that will organise the material for a given project. 


The big advantage is that external memories are organised for everybody and can be changed for all translators involved at 
the same time. 


The disadvantage may be that each translator cannot organise the external memories differently — for instance depending 
on the stage in his/her translation — as those changes will be done centrally and therefore will be used by all the other 
translators. 


= Please also note that memories are fully loaded in the computer memory (RAM): do not use this feature to create a 
. . ? . . p . . . 
very big directory to be shared by several projects, since no computer is yet capable of managing gigabytes of 
translation memories for one single project! 


& As you can at any time change the location of the external memories, you can easily copy the \tm folder organised 
by the coordinator to your computer and organise them differently as you prefer. 
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Screenshot 68 — Changing the location of the external translation memory folder to a server location. 


To change the location of the external memories for your project: 
1— _ InDGT-OT, select Project — Properties menu. 


2— In the field Translation Memory Folder, click on Browse to select the folder in the server where the organised 
external translation memories are stored. 


3— _ Click on OK and you will start receiving fuzzy matches from that “central” location. 


You can change the location again as many times as you want. OT will remember this setting every time you reopen 


this project. 


— PARTG— 
DGT-OT SEARCH FEATURES 
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G.1. Search Project ae 


The public OmegaT Search is very sophisticated with options very important to translators as they can save a lot of time, 
especially for large projects with a large number of reference memories. 


In DGT-OT, the Search window has been redesigned and further improved with features that really make a translators’ life 
easier! 


& Such a wealth of options may be confusing at first, but — believe me — once you understand how it works you won't 
be able to live without it ! 


G.1.1. General features 


In the Search window, you can have all the search options displayed — including the Advanced Options (highlighted in 
blue below) — or just the ones more frequently used, thereby having more space in the window for the display of the 
Search results. 


You can also configure the attributes that identify the results in the Search window. 


®) Part O for detailed information on the customization of Attributes. 


at Seatthh 

| Text to search 
|) i inseerce NOT v | Memorize 

¥ Inansiton || HOT ¥) Memorize 
Pinas Poa | Menorae. | | 


Same tet in al fields AND 9 OR 


ea | oh 
Bact search «= Keyword search «= @ Regular expressions Case aa Q Partial segment «| Whole words «Full segment 
Search options 

| ¥ Remove duplicates ~ Trenslated | Untranslated 0) Translated or untrensiated 


y Search scope 


| Source files | Memory ¥oTMs — Glossaries File or folder name: > | Memorize 


Screenshot 69 — Search window with Advanced Options displayed (highlighted in blue above). 


As you can see from the screenshot above, you can search — by exact search, keyword search and regular 
expressions, strings, whole words and lemmas; use the Booleans OR, AND and NOT; search by regular expressions, by 
author/translator, by date — in source or/and target segments in your project (translated and/or untranslated) and/or in 
the translation memories and/or in glossaries and notes, by date and by author or translator. 


You can also limit the search to a subfolder or a translation memory in the \tm folder. 
If you have given priorities to memories or subfolders with memories, you will have the search results displayed by the 


(alphanumeric) order of priority you have set. 
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For a number of settings, you can also memorize searches to be reused in that session, in that particular project or in all 
the projects. This option is particularly useful for regular expressions. 


® in Section G.5. the list of the Regular Expressions already memorized in the Search window for searches In 
Source. You can copy them to other search fields in the Search, Search/Replace, Search Directory and Search and 
Translate windows as the memorize feature is field-dependent. 


In the same window, you can also filter, by those criteria, segments in your project to have them displayed in the Editor 
pane for editing, by clicking on Filter at the bottom of the Search window. 


To do a search, just highlight in the Editor the term/string you want to search and press the relevant icon or Ctrl+F or 
select Search Project in the Edit Menu. 


The Search Project window is displayed, already with the term/string to be search inserted. Just press Return to accept 
the selection or select other options. 


You can also not highlight any term/string and just write the term/expression you want in the field you want. By default, 
OT assumes that a search is made in the In source field. For searches in the In translation or In notes fields, you have 
to copy/paste or write the term/string. 


OT will remember the settings you used in your previous search and will display them in the Search window therefore 
(maybe) saving you time. 


G.1.2. Text to search BE@ 


You can search terms/expressions In source and In translation. 


Anew OT feature is that you can also search In notes. With the search in notes, OT will display the list of segments 
which have notes with that particular string and you can click on the number at the left of each segment and OT will open 
that segment and you can see the note in the Notes pane 


You can also have the notes displayed together with the matches in the Fuzzy Matches pane. “@® Part O on Attributes 
customization. 


A DGT-OT specific feature is that you can also search not only in the source or in the translation, but also search in the 
source and in the translation segments of your project or of your translation memories using the Boolean AND. With this 
option you can therefore search a particular term or expression in the source language and associate it with a term or 
expression in the target language to search segments which have both. 
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Screenshot 70 — Search using the Boolean OR and the Boolean AND 


® 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


A new feature in DGT-OT is that you can also use the Boolean NOT to exclude terms/strings. This feature can be very 
useful to limit the search if you want to check the terminology/phraseology variants in your external memories. 


So, with the Booleans AND+NOT, you can search a particular term or expression in the source language and associate 
it with a term or expression in the target language to search segments excluding one or the other in order to find variants 


either in the source or in the target language. 


Search of “users” In Source 


Search of “users” In source with the term “utilizadores” In 
Translation and the Boolean AND+NOT 
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Screenshot 71 — Search using the Booleans AND+NOT 


G.1.3. Expression mode and Word mode # 


In DGT, the Search feature has been redesigned and improved and you have — besides the Expression mode options 
— the Word mode options which you can associate to widen or limit the search results depending on your needs. 


The options available are: 
y Expression mode: Exact search, Keyword search, Regular expressions and Case sensitive 
yY Word mode: Strings, Whole words and Lemmas. 


The Expression mode describes how the succession of words (Separated by spaces) is considered. The possibilities 
are: 


y Exact search: the words must all be in the text and in same order, i.e. you look exclusively for the entire 
query string; 


Keywords: one of the words must be present, so "one example" means indeed "one OR example”. In other 
words, you look for all the words in the query independently of the order in which they appear in the 
segments being searched; 


y 


Regular expression: what is in the fields is not words but a regular expression. “@ Section G.5. for 
information. 


y 
The Exact and the Keyword search types support the wildcard characters *' and '?'. The ™' character matches zero or 


more characters (the search term 'run* would match ‘run’, ‘runs’, and ‘running’, for example). The '?' character matches 
exactly one character (‘run?' would match ‘runs’ and ‘rung’, for example, but not ‘run’ or ‘running’). 


The Case sensitive option is obvious: you will only have results in which the case matches your search. 


15 


TM:1-ROFECRENCESINODG-2006-3200600771_01__EN-PT-DWN'1t tmx: * 
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For the Exact Search and Keywords options, you can then decide how each word will be considered individually. You 
have 3 Word mode options — String, Whole Words and Lemmas — as shown below: 


If the text contains STRING WHOLE WORDS LEMMAS 


simple 
characters 


Tested Yes (it contains the word 
“test’) 


Yes (it contains the word 


Protest 


"test") 


Considers words as a 
sequence 


The word may not be Uses the tokenizer 

of preceded or followed by accept Sea 
letters variants of the same word 
Yes Yes 


No (only the word “test” is 


Yes (if using the English 


accepted) tokenizer) 
No (only the word "test" is No (protest is not a 
accepted) grammatical variant of "test’) 


If you are wondering why there are so many options, let's look at the examples below — where there is not only a word 
but also 2 strings of words to be searched — as it is so much easier to show than to explain! 


SEARCH WITH EXPRESSION MODE EXACT SEARCH AND THE 3 WORD MODES 


TERM EXACT SEARCH + STRING EXACT SEARCH + EXACT SEARCH + LEMMAS 
WHOLE WORDS 


Results 


1. withdrawal 
2. Withdrawals 


1. arbitration panel rulings 
arbitration 2. arbitration panel ruling 


panel ruling 


equal 1. equal treatment 
treatment 2. unequal treatment 


102 


187 


48 


Results 
1. withdrawal 


1. arbitration panel ruling 


1. equal treatment 


88 


179 


45 


Results 
1. withdrawal 192 
2. withdrawals 
3. withdraw 
4. withdrawing 
1. arbitration panel ruling 250 


2. arbitration panel rulings 

3. arbitration panel to rule 

4. arbitration panel rules 

1. equal treatment 45 


SEARCH WITH EXPRESSION MODE KEYWORD SEARCH AND THE 3 WORD MODES 


TERM KEYWORD SEARCH + KEYWORD SEARCH + KEYWORD SEARCH + LEMMAS 
STRINGS WHOLE WORDS 


Results 


1. withdrawal 
2. Withdrawals 


1. arbitration panel ruling 

2. arbitration panel rulings 

3. arbitration panel to rule 

4. arbitration panel rules 

5. rulings of the arbitration 
panel 

6. arbitration panel 
decisions and rulings 


caual 1. equal treatment 
q 2. unequal treatment 
treatment : 
3. its treatment is equal 


arbitration 
panel ruling 


102 


419 


51 


Results 
1. withdrawal 


1. arbitration panel ruling 

2. arbitration panel ...ruling 

3. ruling of the arbitration 
panel 


1. equal treatment 
2. its treatment is equal 


8 


88 


378 


47 


Results 
1. withdrawal 192 
2. withdrawals 
3. withdraw 
4. withdrawing 
1. arbitration panel ruling 537 


2. arbitration panel rulings 

3. arbitration panel to rule 

4. arbitration panel rules 

5. rulings of the arbitration panel 

6. arbitration panel decisions and rulings 

7. arbitration panel... Rules of Procedure 

8. arbitration panels ...customary rules 

1. equal treatment 48 
2. equality of treatment 

3. its treatment is equal 
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G.1.4. Search options @ 


You can search by Translated, Untranslated and Translated and Untranslated (the default) segments. This applies, of 
course, to searches that include the project memory (Memory) as in the external memories all the segments are 
translated. 


You can also see just one occurrence (the default) and the number of identical segments found in the memories or you 
can see all duplicate segments by unticking Remove Duplicates if you want to see where those segments came from. 


G.1.5. Search scope ® 


You can search in all — source files, project memory, external memories and glossaries — at the same time or restrict 
the search to one or some. 


The project memory segments have priority over the segments from the external translation memories (if that option is 
also selected) and are displayed first (including orphan segments). 


So, you can select where OT will search for the term/strings in the In source, In translation or In notes fields: 


y Inthe Source files of your project : "translation" and "note" fields also search in what is in the source file itself - 
So it is only useful for bilingual documents, like PO localization files. 


y Inthe Memory — i.e. the memory of your project — including or not translated segments, depending on your 
selection in the Search options. 


In the screenshot below is an example of a search with Search scope — Memory — in which the search was 
carried out only in the project memory where the segments translated for that particular project are saved. 


The results are displayed with the respective segment number in your project. By clicking on that number, OT 
will jump to the Editor and open that segment for editing. 
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| 16340>) - 20/02/16 09:21 - Source: <--> - Translator: <> - Created by: <machame> 

-> ORI “customs legislation” means any legal or requiatory provisions applicable in the territories of the Parties, governing the impor, export anc transit of 
goods and their placing under any other customs regime or procedure, including measures of prohibition, restriction and control; 

-> TRA «Legisiag¢&o aduaneira», as disposicdes legisiativas ou reguiamentares aplicavels nos territorios das Partes que regem a importago, a exportagdo, o 
transite de mercadorias @ a sua sujei¢Bo a qualquer regime ou procedimento aduaneiro, inciuindo medidas de proibi¢So, restri¢fo @ controlo; 


16348>) - 20/02/15 09:24 - Source: <--> - Translator: <> - Created by: <machame> 
~> ORI: “operation im breach of customs legislation” means any violation or attempted violation of customs legislation. 
-> TRA: «Operacdes contréarias 4 tegislag¢do aduaneira». todas as violagdes ou tentativas de viclac&o da legisia¢&o aduaneira 


16352>) - 20/02/15 09:26 - Source: <-> - Transiator: <> - Created by: <machame> 
-> OR: The Parties shall assist each other, in the areas within their competence, in the manner and under the conditions laid down in this Protocol, to ensure 
\the correct application of the customs legisiation, in particular by preventing, investigating anc combating operations in breach of that legislation 
-> TRA& As Partes devem prestar-se assisténcia mutua, no ambito das suas competéncias, segundo as modalidades e as condi¢des previstas no presente 
Protocolo, tendo em vista assegurar a correta aplicacdo da legisiacdo aduaneira, nomeadamente atraves da prevencao. investigacdo e repressdo de operacdes 
contrarias a essa legisiacéo 
Gon One| 


Screenshot 72 — Search window display without the Advanced Options and the search restricted to the project 
memory and to Exact search by strings. 


er 
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In the screenshot below, you can see the same search but this time in the external memories only — the tmx files in the 
\tm folder of your project — and the full Search interface with the Advanced Options is displayed. 
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Yim cores wer =) Memecue 
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[w Rares Guptc ates Freretated Urverateted @ Tranateted ot ut ersteted | 


Source fee [ oMerncre it) tM Chonan oe fee or fotiher mamma = | steppin ] 
eeaocb er . = and = 
V eernysientn wee unter - mrtg = Nera ] 
Prerteed ut wrote baray megrrmneen 1.208 
Author: wy? -\ pemense | treretetor: . ~ | Seomense } 
noryed efter WUE Pe Bh aS . Chenged betwee WORE si" 1 at 
Ped rarcteny eeytmets as : : - | Seeren Advanced Optom 
TM: 2-AGRECMENTS-FUR-LE Xvaw-GEORGU-22014A0830-02-fN-PT-AL tmx) - 01/01/14 OF:00 - Gource! <-2014-220 14A0030(02)> - Transtiator <> - e 


Created by: <ALIGNI> 

> OR! 1. The Parties agree that their respective trade and customs fegistation. as a matier of principle. shall be stable and comprehensive. as well as that 
the provisions and the procedures shall be proportionate. transparent. predictabie. nom discriminatory, impartial and applied uniformly and effectively and wilt. 
imier alia: 

-» TRA: 1. Aa Partes acordam em Que 6% respetives legisiag6es en materia Comercio! e sduanelira, Por Uma Questo Ge principio, Gevern ser estavets @ 
abrangentes, © Que as disposigSes © procedimentos Gevem ser proporcionais, Lansporenies, previsivels, no diecreninatorios, imperciah @ aplicedos de forma 
iuniforme © efetiva, devenco, Gesignadamente 


TM -2-AGRECMENTS-CUR-LEXvaw-GEORGH-220 1 440830-02-LN-PT-AL tmx) - O1/01/14 OF 00 ~- Source: <-2014-220 14A0030(02)> - Transistor <> - 
Created by: <ALIGN!> 

> ORE 1, The Parties shail assist each other, in the areas of their competence. in the manner and under the conditions laid Gown in this Protocol. to ensure 
the correct application of their customs legistation. in particular by preventing, investigating and combating operations th breach of that legisiation 

<> TRA 1. As Parties presiam-se assisténcia miNua. No aMDbITO das suas Competencias. segundo as Modalidades @ as Condi¢des previsias no presente 
Protocolo, tende em vista essegurer a correla apecechc do sue legisie¢do eduaneira, nomeadamente atraves dao prevengéo, investigecée @ repressio de 
operasdces contartas a essa legisiagio 


Screenshot 73 — Search window display with the Advanced Options and the search restricted to the external 
translation memories (TMs) and to Exact search by strings. 


y In the Glossaries, thereby allowing you to use the Expression and Word modes to restrict or widen your 
search and obtaining more or less results than those displayed in the Glossary pane. 


2) customs legistation ~ OmegaT | mmm | (Ena) ox 
Text to search 

|| In source || NOT custome legislation 

|¥l In translation | | NOT : 

[V) an notes [| not ; 


| Soarmme teat in oll fields /AND @ OR 
Expression mode 
@ Oxact search (_) Keyword search ‘ Regular expressions 


Search options 


|) Remove duplicates (_) Translated ( Untranslated (@) Transiated or untransiated 


Search scope 
|] Seurce fites [| Memory ||) TMs |) Glossaries [| File or folder name: 


ir of matching segments: & 


»| Glossary>) - - Source: <--> - Translator: <> - Created by: 
-> ORI: Customs Legislation Committee 
-> TRA: CLA 


Glossary>)- - Source: <--> - Translator: <> - Created by: 
-> OR: Customs Legisiation Committee 
-> TRA: Comité de Legisiagao Aduaneira 


Glossary>)- - Source: <--> - Translator: <> - Created by: 
-> ORE Customs Legisiation Committee 
-> TRA: Comite de Legisiag&o Aduaneira (CLA) 


Gliossary>)- - Source: <--> - Translator: <> - Created by: <> 
-> OR: Working Party on Customs Union (Customs Legisiation and Policy) 
-> TRA: Grupo da Uni&o Aduaneira (Legisia¢&o e Politica Aduaneiras) 


Glossary>)- - Source: <--> - Translator: <> - Created by: <> 
-> ORI: correct application of customs legislation 


— aes a -+ -- 


Screenshot 74 — Search in glossaries 
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y Inthe File or Folder that you may have selected by filling in its name in the File or folder name field. 


The name of the folder must be exactly its name (without TM:) as seen in the example below — 
2-Norm-Memory — in which are only displayed the results from the Normative Memory folder. 


vip) Gurepeen Reseorch infrastructure Comsortuen - Omega 


Same tert aS felds (oO 2 OK 


of matiterg mcprreente: 74 i Searcy fi. Meencns Covers | 
TMi 2-Norm-Memory\RT 0-20 13-80050-00-00-EN-ORI-00_CelexEN-PT-AL tnx) - 30/09/13 13.44 - Source; <—> - Transiator. <> - Created by: <ALIGNI> 

> ORI Counce Regulation (EC) No 723/2008 of 26 June 2009 on the Community legal framework for a European Research Infrastructure Consortium 3 
(ERIC) 

-> TRA: Regulamento (CE) n.o 7232/2009 do Conseiho, de 25 de Junho de 2009 . relative a0 quadro juriaico comunitario apucavel a0 Consorcio para uma 
infra-estrutura Europeta de investigagdo (ERIC) 


Th{z-Nom-Memory}8 1 0-2013-80080-00-00-EN-ORL-00_EN-PT-RET tmx>) - 27/01/12 16:06 - Source: <RT0-2011-800750100> - Transiator- <machame> - 
Created by: <adierja> 

~> ORI: A European Research Infrastructure Consortium for the Common Language Resources and Technology Infrastructure named CLARIN ERIC is 
hereby established 

~> TRA: E estabelecido o Consorcio pare uma Infreestrutura Europeia de investigasao relative 4 Infraestrutura Comum de Tecnologias ¢ Recursos 
Linguisticos, designado CLARIN-ERIC 


ay ST TO-2013-80050-00-00-EN-ORI-00_EN-PT-RET tnx) - 10/03/12 15.46 - Source; <-2012-3201200136> - Translator. <> - Created by: 
<mask 


> ORI: A Buropean Research Infrastructure Consortium for the Common Language Resources and Technology infrastructure named CLARIN ERIC is 
hereby established. 

-> TRA: E estabelecido © Consorcio para uma infraestrutura Europela de investigagdo relative a Infraostrutura Comum de Tecnologias o Recursos 
Linguisticos. designado CLARIN-ERIC 


TM)2-Norm-Mamory\ik T0-2013-80050-00-00-EN-ORI-00_EN-PT-RET tmx>) - 10/02/12 15°46 - Souree’ <-2012-32012001236> - Transintor <> - Created by: 
<master> 

> OR! CLARIN shall have the legal form of a European Research infrastructure Consortium (ERIC) incorporated uncer the provision of Regulation (EC) 
No 7223/2005 and be named "CLARIN ERIC’ 

«> TRA A Infraestrutura CLARIN assume a forma juridica de um Consércio para uma Infraestrutura Europeia de investigacSo (ERIC), criade ao abrigo das 


them nonin Ben stm Bint (OE «POR OAO nn stele enna ceo Ml ABIL SOI. 
ewe ~e te tomer | | 


Screenshot 75 — Search limiting it to the 2-Norm-Memory subfolder in the \tm folder 


If it is a search limited to a single file, it must be the full name of the file (path included if it is a file in a subfolder 
of the \tm folder), extension included as in the example below: 1-IWAIN-REFERENCES\NoDG-2009- 
32009R0723_EN-PT-DWN.tmx. If the tmx file is not in a subfolder, it is only necessary to have the name of the 
file (in this case it would be: NoDG-2009-32009R0723_EN-PT-DWN.tmx). 


MOT eurapean research infrastructure commormen 


Sartve terete alt fekde Am @ On 


| 7) Source Mes  Memnery 9 Tee 1 
Dr of mati temg segments» = 
TN] 1-MAIN-REF ERENCE S\INGDG-2009-3200SRO723_EN-PT-OWN | 


caivin’ 4 — = =: ee pre fee Lusearch yn | palaneed Opnens..; 
>) - 17/11/09 15:31 - Source; <-2009-32009R0723> - Translator. <> - Created By = 


> ORE Council Regutation (EC) No 7232/2009 of 25 June 2009 on the Commumty legal framework for a European Research infrastructure Consortium 
(ERIC) 

-> TRA: Regulaments (CL) 9.° 7223/2009 do Conseins ce 25 de Junho de 2009 relative ao quadre juridice comunkario aplicavel ao Consércio para uma . 
infra-eatrutura Europeia de Investigagio (ERIC) 


7 >) = 17/11/09 15:31 - Source: <.2009-32009R0723> - Translator: <> - Created by 
<saferd> 

-> ORE in contrast to Joint Technology initiatives (JT!I) constituted as Joint Undertakings of which the Community is a member and to which It makes financial 
comtributions, 4 Guropean Research infrastructure Consortium (hereinafter referred to as “HE RIC") should not be conceived as a Community body withan the 
meaning of Aricie 185 of Council Regulation (EC, Euratom) No 1605/2002 of 25 June 2002 on the Financial Reguiation applicable to the genera! bucget of the 
European Communities [6] (the Financial Regulation), but as a legal entity of which the Community is not necessarily a member and to which it dees not make 
financial comributions withwn the meaning of Article 108(2)(f). of the Financial Reguiation 

-> TRA: Em contraste com os iniciotivas tecnolégicas conpuntos (ITC) constituidas como empresas comuns de que a Comunidade @ membro e pore a qual = 


Screenshot 76 — Search limiting it to the 2-Norm-Memory subfolder in the \tm folder 
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G.1.6. Memorize searches B@ 


This is a brand new DGT-OT feature. It applies to several fields as can be seen in the screenshot below. 


r 


Lael Text Search it~ <<. * <a.) 2. «ci ii. 
Text to search 


[rem display template 


~ In source NOT mise | ~ |. Memorze |) 
¥ In transtation NOT For Uus session ~ | Memorze 
¥)\ in notes | | NOT | For the project 7 | Memorize 
Some tent in all fields AND @ on For of projects | 
‘Expression mode a a = i Cancel | 
Dxact search Keyword search @ Regular expressions Cate senuitive I S Partial Vgment TT Whote words Full 
Al OR S98 9960068 —$a $a _$ $$ S———————————— ——~~—~—-—~—~- —— 
| ~ Remove duplicates Translated Untronsioted © Translated or untranglated 
; Search scope —— . _ : 
Source files “ Memory TMs Glossaries Pile or folder nome: ~ 


Template variables: S(preoamive - mince Configure format | be 


Number of matching segments: 31,000 ) 
) 
Author: ~ | Memorize |_| Tronstotor: NOT ~ Memorize | 
||| Changed ofter: oo/t : Changed before: 2/01/14 16:31 ’ J 


Search Advanced Options 


Screenshot 77 — Memorize feature 


You can memorize your searched terms by pressing the button Memorize in the field you want and selecting if you want 
to memorize the term/expression only For the current session (i.e., it will be deleted after you close the project), For 


the project (and thereby having it available for that project no matter how many times you close and reopen it) or For all 
the projects you translate using OT. 


This feature is available for the fields: In Source, In translation, In notes, File or folder name, Author and Translator. 


You can memorize words/strings and, particularly useful, Regular Expressions. In Section G.5. is available the list of 
regular expressions already memorized for the In Source field. As the feature memorize is field-dependent, if you want 
you can copy those regular expressions to other fields ... or write new ones if you know how! 


G1.7. Author and translator B® 


In DGT, there is a difference between the author and the translator in the external memories attributes. 


The author is the person who has made the alignment of the source and target documents, which may be an assistant or 


a translator or an automatic alignment without human checking. The translator is the one defined as such in a dedicated 
field in the tmx files stored in Euramis. 


Search of “spectrum” In Source 


Search of “Spectrum” In source with Translator: matosca 
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Screenshot 78 — Search with and without limiting it in the Translator field 
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In DGT released documents, the name of the translator may be missing if a post-alignment is made and that information 
was not recorded in the tmx file. 


There is always an author (of the alignment), but for published legislation there is never a translator. 
In the project memory — and the memories generated from it — the author and the translator are the same of course. 


This difference may be important when you have many external memories and you want to search by the login of the 
translator who may be the “reference” for a particular domain. 


G.1.8. Search by date: changed after and changed before 
In the Changed after and Changed before fields you can set the data and hour. And of course you can combine it with 
other fields. 


In the example below, the search is made for the term “bands”, translated by matosca, after a certain date thereby 
restricting the search from 127 to 4 results. 


Search In source of the term “bands” without other Search In source of the term “bands” with the 
search criteria Translator matosca and after 9/1/2010 
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Screenshot 79 — Search in the external memories by date combined with Translator 


G.1.9. Number of matching segments and indication of the number 
of results 


You can define the number of matching segments you want OT to display in the Search pane. You can increase it — to 
several thousands — if you want, for instance, to count occurrences of several ways of translating a source terms into 
the target language. 


OT indicates the number of matching segments found. 
If you select a (very) high number, it may slow down the search ... but at an acceptable level. By default it is set to 1,000. 


This may save you a lot of time as you can quickly determine which are the frequent/infrequent translations of a term/string if 
you have a large number of reference memories. 


am 
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G.2. Filter searched segments o 


In fact, the Search feature itself is already a filter as you can limit the search in several ways. 


So filtering in OT really refers to the editing of the searched terms/strings in the project memory and therefore it is — 
obviously — only applicable to searches in the project memory. 


You can, in a single operation, search for words/strings/regular expressions as explained in the previous sections and, 
using the search results, filter those segments for editing. 


Just click on Filter at the bottom of the Search window. Those filtered segments will be displayed in the Editor and you 
can modify them. 


Anew OT feature is that, to return to the full document and continue working normally, you click on Remove filter, which 
is now at the top of the Editor. No need to go back to the Search window (which you might have already closed down!) 
as was the case in previous OT versions. 
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Screenshot 80 — Filtering — Editor pane displaying for editing the filtered segments selected with the Search 
feature in the whole project 
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G.3. Search and replace me 


You can do a Replace Interactive (one by one) or a Replace all in all the documents of your project using the same 
options as in Search Project (exact search, keyword search, regular expression, case sensitive, strings, whole words 
and lemmas). 


To search/replace a term/string, just highlight in the Editor the target text you want to replace, press Ctrl+K (or click on 
Search and Replace in the Edit menu or use the icon) and in the field Replacement type the text you want to replace it 
with. 
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258> 
No entanto, os utilizadores ou empresas em digressdo que desejem oferecer os seus servigos audio no estrangeiro necessitam de utilizar equipamentos que 
possam funcionar em faixas de frequéncias diferentes om cada um dos Estados-Membros ou utilizam equipamentos mais dispencioses com maiores 
icapacidades de sintonizacdo 
No entanto, os utilizadores ou empresas em digressdo que desejem oferecer os seus services audio no estrangeiro necessitam de utilizar equipamentos que 
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Screenshot 81 — Search displaying matching segments. Option to Replace all 


You can “preview” the segments affected. OT will display the segments affected: as they are in the project memory (in 
red) and as they will be after the replacement (in blue). 


To launch the replacement operation, just click on Replace all or Replace Interactive at the bottom of the 
Search/Replace window. 


™ It is strongly recommended that — if you choose the option Replace all — you have a look at the results displayed to 


check that your settings do not included unwanted segments as in OT there is no Undo/Redo option for this 


operation. Therefore, be extremely careful! 


To be on the safe side, do a Save (Ctri+S) before doing a Replace Interactive/all as, in extremis, if you make a 
really BIG mistake in this operation you can use the last backup of your project memory stored in the project 
\omegat subfolder and not lose any work you did before the replacement operation. If you forget to do it, don’t 
worry, you can always use the last backup without losing much work as OT does automatic backups every 3 
minutes. 


® Part P on Troubleshooting. 


oe 
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If you select the option Replace Interactive, just click on Skip — if you don’t want to replace it — or Replace Next if you 
want to replace it. When you finish, click on Finish and OT will display the Editor window with the segment last replaced 
open. 


Laas ajett fn Gote View Tents Gpenern pres 


Sica coc QKIAVS TO AO RES 

Manchin Tonvalaliee : c-) 

Aptcacées @ucio sem fics, migresB de PMSE com oulras tecrotogiae e/ou bancas protetoras. st 

. 

| tier = CRC ROiN-eeaaaind-atenTRAee DOCK -c 
(Se | Reece net | orien 


| Translation last moaiied by mecharre on 20-Jun-2014 at 09-42'97 me || 
“Migration of wireless audio PMSE applications to other technologies and/or bands. 
| anaeene actus “TRA™ > 

| Migraglo de outras apticacSes PMSE fudio sem fics para outras tecnciogias e/ou MiNi. 
<end segment 


| Migrag io de outras aplicegSes PMSE duciio seen flies para ovtras 
‘tecnotogian @fou faces 
_Migrog4o de ovtras aplcagSes PMSE audio sem fos para ovtras . 
Seine Vee tyennen torre y SNe technology in nearby bands, where wireless audio | ec nofegias e/ov bendes 

_PMSE operates on @ non-protection and non-inerterence 
Pe er Prejudiciels cavsoces pein tecnoiogis LTE nas edjacentes, quand> as 
aplice¢ées PIMSE Gudic sem fics funcionam numa base de néc-preteciio e de nilo-interferéncia, 


Praagconarletch papenaatap pnctinnd Prenat eareareay ahbeporstaa hotahag 
Legaipinons biok aon spoeate tn ermerent & peers Soon oF 

povboye neon OF USe MOF expensive equipment with wicer tuning capabitities, 

No entento, of ublzacores ou empresas em digrestdo que Sesejem oferecer os seus services 

UGIO ND estrangeiro necesstiom de Utkzar equipamentos cue possem funcioner em MRE de 

frequéncias diferentes em cada urn dos Estados-Membros ov utlizamn equipemertos mais 

@spenciosos com muiores capacidades de sintonizacho. 


Heaapsaypabtaceadycedh y to lose availability of white spaces in the UHF band, and wilt 
|nave to migrate to other pacbavdnoarciy encapclonhy 

& provevel que os services PIMSE Geixem de dispor de espaccs Drancos na fame UF. pelo 
Que, a lorgo prezo, terSo de migrar pare ovtras tecncliogies/cu 


(Member States remain free to make available spectrum in addition to the 60 MHz either No entanto. os utilizadores ou empresas em digressio que detejem 

(ri thee samme Wequency ranges or’ in others, jOferecer 05 S0US Services dudiio NO estrangero Necesaltam de ublizar 

Os Estodos-Mernoros continue ter a Iberdede de deponilizar espetro para atom cos lequipamenos que possan funcionar em bandas de froquéncias 

60 Miz quer nas mesmas (RRB se trequencia quer novres «| diferentes em cada um dos Estacos-Membros ou utilzam equipamentos 
ccpmery Comment Gossey imetme trancatmas = «hotee Mas SEpENtosos com MaOCres CapaCKiades Ce smtorazacso, be 
mamerwed en 1 [Gane] Reptoce enters | Replace of Pore Sal. 


237> 

Imerferéncios prejudicials cousacas pela tecnotogia LTE nas feixas 
adjacontes, quando as apSicacdes PMSE audio sem fos functonam numa 
base de nGo-protecdo ¢ de nbo-interteréncia 

Interteréncias prejudiciais Causacas pela tecnctogia LTE nas bandas 
ecpacentes, quando es apicacdes PMSE audio sem fics funcionam numa 
base ce n&o-protegho @ de nBc-imerierencia 

258> 

No entanto on utlizacdores Ou empresas om digrensto que desejem 
Oferecer 08 SOUB SErVKoS udic NO eStfangero Necesstan Ge Ublizar 
(@Quipamemios que possem funcionar em faixes de frequéenciss 
diferentes em cace um dos Estacos-Membros ou Utilizem equipamentcs 

| mals diapendosos com molores capackiedes de sintonizacBo. 


Screenshot 82 — Filter option: Replace Interactive (one by one) 
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G.4. Search in monolingual reference documents — 
Search Directory ™® 


You can search monolingual documents in the formats accepted by OmegaT (e.g. Office and PDF) using the Search 
Directory feature. Search Directory does not accept tmx files. 


In DGT-Omegaf, this search is separated from the Search Project feature and has a dedicated window that you can 
access via the Edit — Search Directory menu or with the Ctrl+Shift+K shortcut. 


To use this feature, just: 


1— In Windows Explorer, copy your reference document(s) to a new subfolder that you create in your project or 
anywhere in your computer 


2—  InDGT-OT, highlight the term to be search and press Ctrl+Shift+K 
3— __ Inthe Location field, click on Select Folder to browse to the folder where you have your reference document(s) 


OT will remember that location in future searches. However, when you close and reopen that project, you will have to 
select the location again 


If you have Recursive search checked, it will also search for documents in subfolders of the selected folder. 

This search is quite fast ... as long as you don’t have an enormous amount of reference documents! 

& Very useful if you have monolingual reference material in the target language. It saves time as you can search 
directly in OT if a particular term is used in national legislation, for instance. 


As with Search Project, you have a number of options to restrict or limit your search. The options available are, mutatis 
mutantis, as explained in the previous sections. 


Test to search 


 Lawet search Keyrrord search Keguier expresaens Case sensative | @ Strings Whole words Lemmas ] 


Formptate variabies: £ (pr earmnte wa Centqure format ] 


lr FY SPRAY COTS ATS 
Lamber at matching segement= soo > 
Mr Of (matching segments: 2@ Seaeats Adve Cptnenre 


NATIONAL-PLAN docx ~ 
2013 (seguidamente designado «Programa-Quadro Horizonte 2020») visa obter um maior impacto na investigacao e 
inovagSo mediante uma contribui¢So para o refor¢o das parcerias publico-publicas, nomeadamente com a 

participac3o da Unido em programas empreendidos por varios Estados-Membros em conformidade com o disposto 

no artigo 185.” do Tratado. 


NATIONAL-PLAN docx 

A Unio proporcionara incentivos para melhorar a coordenagSo, garantir sinergias com as politicas da VE, contribuir 
para essas politicas e para as pricridades do Programa-Quadro Horizonte 2020, acormpanhar a execug3o do 
programa e assegurar a protecSo dos interesses financeiros da UE. 


NATIONAL-PLAN docx 

A participagSo em aces indiretas financiadas pelo Programa EMPIR est4 sujeita 4s disposicSes do Regulamento 
(UE) n.* .../2013 do Parlamento Europeu e do Conselho, de de 2013 que estabelece as Regras de Participacao e 
Difus3o relativas a0 «Horizonte 2020 — Prograrna Quadro de Investigagao e Inovag3e (2014-2020)». 


NATIONAL-PLAN docx 
a er ee ee FO Oe Ee ee | ES Se Deere 
| Close 


Screenshot 83 — Search in monolingual reference documents 


1 | 
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G.5. Regular Expressions ae 


In theoretical computer science and formal language theory, a regular expression (abbreviated regex or regexp and 
sometimes called a rational expression) is a sequence of characters that forms a search pattern, mainly for use in pattern 
matching with strings, or string matching, i.e. "find and replace"-like operations. 


® the article in Wikipedia and Chapter 17 of the public OT Help/Guide for an overview of RegEx. 
In DGT-OT, you can use RegEx in Search Project, Search Directory, Search and Replace and Search and Translate. 
Regular Expressions can be very useful if you know how to use them ... and if you are careful! 


™ If you use them in Search and Replace all or in Search and Translate, it is highly recommended that you have a 
look at the results displayed to check that your settings do not included unwanted segments. The same 
recommendation applies too: do a Save before launching the replacement operation so that you can use a project 
memory backup if something goes wrong. 


Regular Expressions work only in string mode, implicitly. For that reason, when you select that option, the Words mode box 
disappears and new options are displayed: 


y Partial segment: accept any segment containing the regular expression, even if there is something before and after 


yY Full segment: the segment matches the regular expression, with nothing before or after. This is equivalent to 
adding “ at the beginning and $ at the end of the expression 


yY Whole words: equivalent to adding \b at the beginning and end of the segment 


Tent to search — 


| source not 
¥) ih trenstation norT 
Jv) in notes NOT 


Same test in of fiekis 


Dogression + mode - : — — ; — re - — Regular expression n made — 
| Cxect search Koywerd search @ Rogular exprascmons @ Pertial segment Véhole words 


Screenshot 84 — Regular expression mode 


To make your life easier, in DGT-OT some regular expressions are memorized and given a more or less understandable 
name. 


= gosit\s 
BStteneeis co zi- 


[ie Remove Copecotee Vi eveteces Unirenattes @ Traneieted a Careneteces | 

heer nh wape — 
leone ats eelmookey eS ion ExlUaasine | live Amma ae ~ Luana 
Pe of meatcheny cegerents: &F Sear Adewrned Optrerre 
| 
| 86>) - 04/02/15 11.01 - Source: <—» - Translator <> - Created by. <meaechame> = 
| > OR! Marble. ravertine. ecaussine and other colcareous monumental or Duliding stone of an apparent specific gravly of 2,5 or 
|More. and alabaster, whether or not roughly tammed or merely cut. by Sawing oF otherwise, into blocks or slabs of a rectangular 


jereone aquare) shape 
>» TRA: Marmores. travertinos. granites Deiges « outras pecras caicarias de cantaria ou de conmstrucéo. de Gensidade aparente 


ligual Ou Superior © 2.5. & Slebestro, Mesmo Gesbestados ov simplesmente cortedos 6 Serre Ov Por OVTro Melo, em blocos ov places 
|de@ forma quadrade ou retangular 


|Zoa>) 04/02/16 14:39 « Source: <=> « Transiator <> « Created by: <machame> 

-~ ORI: ---- Mixtures consisting mainiy of (S-ethyi-2 +“methyl-2-oxido-1.3.2-dloxaphospninan-S-ytenethy! methy! methyiphosphonate 
lene bial( S-ethyl-2-erneth y!-2-ox)do-1 3. 2-diozephosphinan-S-ytrnethy!] methyiphosphnonete, and mixtures consisting mainmy of dirmetiny! 
|methyipnosphonate. oxitane and diphosphorus pentacxide 

= TRA Mistures Comstituidas principaimente por metilfostonato de (6-etil-2-metit-2-cudo- 1. 3.2-diokafosfinan-5-il}metiimetiio @ 
Metiifoefonats de bie{(S-etil-2-metil-2-oxtdo-1,3,2-dioxafosfinan-6-il metic), © misturas comstitulidas principalmnente por metilfoefonato 
Ge AimHatio. CxIFANOS @ PEeMtonidoe de aifostora 
an a eS a erecta 
|418) - O4/02/165 14:42 - Source. =—> - Translator: ~~ - Created by: ~machamer 
| > ORI. - Fatty-ectd mono-alky! esters, containing by volurne 06.5 % of more of esters (FAMAE) 
> TRA: - Esteres monosiquilicos de écicdios gordos (FAMAE). ave contenham, em volume. 96.6 % ou mals de osteres 


561) - 04/02/15 15.41 - Source <--> - Translator, <> - Created by. <mechemre> 
> OR! Of a density exceeding 0.5 g/crn<tO/>3<11/> Dut mot exceeding 0.8 gram <12/> 3<13/> 
-~ TRA -- Com densidade superior a 0.5 g/cm <tO/>2<t1/> mas nic superior # 0.8 gfcm* 


|1249>) - 68/02/15 13:47 - Source: <--> - Transiator: <> - Creates by: <mechame> 

-* ORI: - Non-alloy pig Hon containing by weight more than 0.5 % of phosphorus 

| -& TRA: - Ferre fundido brute nde igade, que contenhs, em peso, mais de 0.5 % de fosfore 
Paes oe a ae 


= met 


Screenshot 85 — An example ot Search with a Regular Expression: Decimal numbers (2 decimal places) separated 
by comma (,) 


1) 
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The Regular Expressions in the list below are memorized for Search Project in the In Source field. They will be 
displayed when the Expression mode is set to Regular Expressions and you press the select button to display the 


dropdown menu. 


As the Memorize feature is field-dependent, write/copy the regular expressions you need to other fields in the Search 
Project, Search/Replace, Search Directory or Search and Translate windows. 


If you want to write your own RegEx, you can include a description. For that, use the vertical bar “|” followed by the 
RegEx (like those memorized), making sure that it is not a string that is likely to happen in any text. 


To memorize Regular Expressions, just write or copy the RegEx you want in the field you want, click on Memorize for 
that field and select if you want to save it For this session, For the project or For all projects. 


Regex memorized | Whatitfinds 


Regular expression 
fld}{2,},[\d}{1,2} 


Nd}{1,}\.[\d}{1,2} 
[ld}{2,}\.[\d]{1,2}% 


[\d}{1,2}1\.— 
VINd]{1,2}11.- 
VINd}{4} 


[\d}{1,}% 


[A-Za-z]+ 


A\([a-zA-Z]{1,4}\) 


|[Aa-zA-Z]+$ 
\({ld}+\) 
[ldl\.\s,<>tl}+ 


f\d}{2.,}- 
AlNa-zA-Z}*$ 
fld}{1,}\s[\d}{3} 
\(hd}{1,}\s[\d]{3}\) 
\s\st 
\b[A-Za-z]+\b $ 


(\b\w+\b)\sld\b. 
A\b[A-Za-z]+\b 


1-2DECIMALScomma 
1-2DECIMALSdot 


fle 
2DECIMALSdotPERCENTAGE 


DATES 


DIGITSall 


DiGITSpercentages 


LETTERSall 


LETTERSstartBRACKETS 


NoLETTERS 
NUMBERSbrackets 
NUMBERSdotCOMMAtagSPACE 


NUMBERShyphen 
NUMBERSonly 
NUMBERSseparSPACE 
NUMBERSseparSPACEbrackets 
SPACEdouble 

WORDend 


WORDSrepeated 
WORDstart 


Decimal numbers (2 decimal places) separated by comma (,) 
Decimal numbers (2 decimal places) separated by dot (.) 


Decimal numbers (2 decimal places) separated by dot (.) immediately followed by % 


Dates in following formats: (D)D.MM.YYYY, (D)D-MM-YYYY and (D)DIMMIYYYY 
At the beginning of segment : add “at the beginning, e.g. “[\d]}{1,2}[\.- 
VINd}{1,2}1).-V]d] {4} 

At the end of segment: add $ at the end, e.g. [\d]{1,2}[\.-W][\d]{1,2}[\.-W[ld]{4}$ 


Any number of digits 

W/ NOT ticked: only segments without digits 

W/ "Full segment" activated: segments with digits only (i.e. no spaces, punctuation, 
symbols or letters) 

W/ "Partial segment" activated: segments which possess at least one digit 


Digit(s) immediately followed by %, e.g. 5% 
With a space before %: [\d]{1,}\s% (\s caters for all kinds of spaces and tab) 


Any number of upper- and lowercase letters (i.e. no spaces, digits punctuation, 
symbols or letters from other alphabets) 

W/ NOT ticked: only segments without any letters 

W/ "Full segment" activated: segments with letters only (i.e. no spaces, punctuation, 
symbols or letters from other alphabets) 

W/ "Partial segment" activated: segments which possess at least one letter 


Segment starting with characters (case-insensitive) in brackets, e.g. (a), (i), (vi) or 


(vi) 


Numbers in brackets with no spaces 
Numbers including dots, OmegaT tags, commas, spaces in any combination. 
W/ "Full segment" activated: segments consisting only of numbers and/or dots 


and/or OmegaT tags and/or commas and/or spaces in any combination 
W/ "Partial segment" activated: not suited 


Numbers followed by a hyphen 

Segments only with numbers 

Numbers with a space between groups of digits, e.g. (10 000) 

Numbers in brackets with a space between groups of digits, e.g. (10 000) 
Segments with double space 


Segment ends with word 
W/ NOT ticked: Segment does NOT end with a word 


Segments with repeated words 


Segment starts with word 
W/ NOT ticked: Segment does NOT start with a word 


am 
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The memorized searches for the project are stored in the main folder of your project and the memorized searches for all 
projects are stored in the CONFIG-PERSONAL subfolder of the OmegaT_Projects folder in your computer, both in 
files named search.tsv. 


In the example below, you can see the file where the regular expressions are already saved by default in the 
_CONFIG-PERSONAL subfolder. 


In the Search windows you can memorize regular expressions or terms but you cannot delete them. 


If you want to delete regular expressions that you no longer want, just open this file in Notepad++, delete the lines with 
the terms/regular expressions you don’t want anymore and save it. 


wo] by] Search: CONFIG: 


fije Edit Format View Hep 


Ful lProjectSearchwindow SW_SEARCH_ SOURCE :REGEXP 1-2DECIMALScomma|[\d}{1,},[\d]{1,2} 
Ful lProjectSearchwindow SW_SEARCH_SOURCE :REGEXP 1-2DECIMALSdot| [\d){1,}\.[\d] {1,2} 
Ful lProjectSearchwindow SW_SEARCH_SOURCE :REGEXP 1-2DFCTMAL SdorPERCENTAGE | [\d]{1,}\. (\d]{1,2}% 
Ful lProjectSearchwindow SW_SEARCH_SOURCE:REGEXP DATES| E\d 141, 23E\.-\/] ENG) 41, 2A. -\/) Nd 4} 
Ful lProjectSearchwindow SW_SEARCH_SOURCE :REGEXP DIGITSall|[\dJ+ 
Ful lProjectSearchwindow SW_SEARCH_SQURCE:REGEXP DIGITSpercentages | [\d]{1, }% 
Ful lProjectSearchwindow SW_SEARCH_SOURCE :REGEXP LETTERSal] | [A-Za-7]+ 
FullProjectSearchwindow SW_SEARCH_SOURCE :REGEXP LETTERSStartBRACKETS | A\([a-7A-7]{1,4}\) 
FullProjectSearchwindow SW_SEARCH_SOURCE :REGEXP NOLETTERS|A[Aa-2A-7]+$ 
FullProjectSearchwindow SW_SEARCH_SOURCE :REGEXP NUMBERSbrackets|\({\d]+\) 
FullProjectSearchwindow SW_SEARCH_SOURCE -REGEXP NUMBERSdotCOMMAt agSPACE | [\d\. \s ,<>t\/]+ 
FullProjectSearchwindow SW_SEARCH_SOURCE :REGEXP NUMBERShyphen | (\d] (1, ) 
FullProjectSearchwindow SwW_SEARCH_ SOURCE :REGEXP NUMBERSonly[A[\d]+$ 
FullProjectSearchwindow Sw_SEARCH_SOURCE :REGEXP NUMBERSseparSPACt | [\d] may tee 
FullProjectSearchwindow SW SEARCH SOURCE :REGEXP NUMBERSseparSPActbrackets | \((\d] (1,}\s[\d) {3)\) 
FullProjectSearchiindow SW SEARCH SOURCE:REGEXP SPACEdoub lel \s\s+ 
7REGEXP WORDend|\b[A-Za-z]+\b $ 
WORDSrepeated| (\b\w+\b)\s\1\b 
WORDs tart |\b[A-Za-z]+\b 


Screenshot 86 — The search.tsv file where the memorized regular expressions are stored (in this case for all the 
projects). Here is also a term (spectrum) that has been memorized. 
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— PART H— 
SEARCH IN QUEST, DOCFINDER, 
EURAMIS AND IATE 
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In DGT-OT you have direct access to these EU/DGT applications, just by highlighting the term/string/OJ reference you 
want to search and clicking on the relevant icons or using the shortcuts. 


H.1. Search in Quest a= 


Quest is a metadata engine through which you can search several databases in a single operation. In this example, 3 
databases are selected — IATE, Euramis and Eur-Lex — and there is the indication that there are other 32 databases 
that can be selected and searched. 


Home 5 LHelp Contact © | About My profile History — | Exit 


Se Search | 


Source larget(s): [titan tea (ft 


Options @ Exact string 0 All words MACHADO Maria Jose {DGT.B.PT.1) @ Commission 
@ Tate Results 1- 100310 rem 
© turamis 
© furlex 52015X00304(03) Explanatory notes to the Combined Nomenclature of the European Union 
Other OFC 76, SAIS, p, 1-388 (BC, ES, CS, DA DE, ET, Et, tN, FR Pa, IT, OY, CT, Pal, MIT, AL, Pi, PT, BO, SK, St. Fi, SW) 
D Offine: ? 7 Direc! text access 5 fn Author European Comission 

~ . forms Mobce + Date of documeémt 04/0320! 5, Cute of publication 
D Not selected in profie: 32 
02007R1528-20141225. Council Regulateon (LC) No 1528/2007 of 20 December 2007 applying the arrangements for products | 


Originating in certain states which are part of the Afncan, Caribbean and Pacific (ACP) Group of States provided for in agreements 
establishing, or leading to the establishment of, Economic Partnership Agreements 


Direct text access Ae Author Mor available 
Form Comsolitated text Date of doxasrnere 25/12/2014 


01993R2454.70141205° Commission Requlation (FEC) No 2454/93 of 2 July 1993 laying down provisions for the implementation of 
Council Regulation (EEC) No 2913/92 establishing the Community Customs Code 


Direct text access Ge Author Not avadate 
Form Consolsated text me of docursent: 05/12/2014 


02014R03/4-20141 102: Regulation (EU) No 3/4/2014 of the Luropean Parliament and of the Counctl of 16 April 2014 on the reduction 
of elimination of customs duties on goods originaling in Ukraine 


Direct vext access (S)fae Author: Mor avavlatie 
Torm Consobdated tex! Sate of docursent 02/11/2014 


3201481 101: Commission Implementing Regulation (LU) No 1101/2014 of 16 October 2014 amending Annex | to Council Regulation 
‘ rs + | (EEC) No 2658/87 on the tariff and statistical nomenctature and on the Common Custonss Tariff ° 


Screenshot 87 — Search in Quest in this case showing results from the EU Official Journal 
In Help, there is all the information about Quest with (Quick) Guides. 


™ The search in Euramis and IATE via Quest only gives access to some of the features of those databases. If you 
want to have the full options available, access them directly using the respective icon. 
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In My profile — at the right top of the screen — you can select other databases by checking the respective boxes. 


O UT Standards Board 


Descript< 
The teteral biegual Catebace ¢ 99 espertia! ink betwees us and the Irish language treedater unt cf te teat Gowernenert who trarsate all peitaary legssiaton. 


AcadTern daabace cortare terre in LY, EM, 2, DE, FR and LA, but pramically the language coverage desends on tre languages comained in Cu vercus demoranas, pcerares arc tert lee 
edodes @ MeyiTerm, ard for Be mos 294, 2 ¢ cody (A and | 


El Coad dacerrerts repertory mn al hod languages 

DETERM ¢ 99 extract cf the terminology detahase crgmully develitged tor mhemal purposes Sy the Gerrtas Teanalation Sector (575) cf the Untied Nations Secretenat, New York. 
OG Wate doceret regery 

Contest ct EsrcTer hosted iccally 11 Quest dataonce 

SClerreac tenets. tict mvaiatie cutsde Quez. 

ESTER Fest 

Al contents of ter Lee Commmanty law catabase, mctudieg case law ard oreperatery ots 

Avtbuved cortents of the od Ear-ex Comrrancy aw Cacabase. 

HRAMIS bespeas 2everend Rubingusl efrmaten System 

[Olfbee] Ewes: Partiarnent reper repeamtory in af aFcial lavguager, RuHtent search, Th pariementary tern. 
[Ollie] Corcpeas Petigrsent reports ~epeatiory n af ofcel languages, Ubtext search. Gb parkementary toon 
Eurviec © a mutlingaa’, mutcsoptnnry thecauras cavern the actviies of the Bu, the Eurooean Fariamert in caricetr, 
Severna Corpus of benalanors of BY legalaton 

Severus Nabiingsal teemieetoy tatysese 

FAD Sernberk n Enginh, French erd Speneh 

Tigh Language Termasegy 

Toas las tarrves puids au scurta! aFoe par a Commence pinérale Ce termmrciogie et cs shsloge 

UATE « Steesichave Termnciegy ‘cr Eercpe. [ATE Ewepean rettetors lecrbert, corters of ofc languages 
Termnciope 

Tea gockeg o Coremmsics ertoese wocebclary, with efteroctve features. Wx, evolable retece Quest 

Tas Tmagiaters (adener, 9 teed dedicated ts indeeing hidden internet cessarees fer legiahs Ser mvelasic setede Quee. 
International Orpecnsabons Vocudviary. Nex availabe ovtude Quex. 

International Teleccrrmanestions Unice Tecrite temubact, corters (ngist, reach and Spans 

DG SCIC Lance Interactve Toa! 

Seiirqual temiace martened by the trandlation fier Lopes. 


‘The catudace of the Lituarar Scascarce Scare cotars tary moore [and good) techeca! terns U7, EM DE FR, LV and Fuswar languages. Most cf tuce terme crghute bon 
ttenatoral, Sovopees and Uuaner sercerds. 


Langage Ceperaert’s Maker Detspaven, Wat evaluable evtsce Quest. 

Lorerassho- 24 well as generally beck, orrt- avd pbleahor relates semncegy 

Rhaeaoancen g Sweder's ranows sem Lave on the web. B corcars appro. £27 99) term records coverng beveed Swecat, Engish, Germar and Frosh, 

The MeSH (Medea Subject ilendings! is the controled vocedulery Heseanes of De WL [US National Licey cof Medicre) WLM ss the crestor, mantener, and provider of the dete. 
Mubiegual t2emioate martared by the Fenah Terminology Covtre °K. 

Terns cortens ters mo the felis of intellemual provety, dectvec commmuncetars and auatce in Setornan, Eaglsh and Piesen. 

Puega “ermey backs 

EWRVESFT Termbank martaived ty the Caradian Federal Goverment. 

Hensh Govermert lerrbark vate 


Screenshot 88 — List of databases available via Quest 


a 
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H.2. Search in Euramis = 


You can search Euramis via Quest and have access to an interface with limited options in terms of number of results and 
other Euramis features. For instance, the only option is to search by exact string or all words. 


| About © | My profile History 


source: ED surges): EIEN 0 


Optom: @ Lxactstneqg 0 All words 


Search text; squeegee 


Memoncs laripu Mox. results 
0 EN oT w 0 second(s) 


TM: O}4 apices Your: 2014 St: EN (Direct) Trams unkown Dow, Type: Regulation 


EN: Frooms. brushes including brushes constitutiog perts of machines, PT: Vassourns & escovns, mesmo constituinds partes de méquinas, a 
aopliances or woidenh, |, hard-operated mechanical sweepers, mot = aparaihos ou Ge velovlos, vassouras mecanicas de uso manual mao 
motorised, mops ond dusters; orepered knots ond tufts for motoclzades, pincéls ¢ escanodores; cobecns oreporades para escoms, 
broom of brush malting; paint pads and rollers; sqaeegees (other than pinedis tomolhsctes; bonecas ¢ rolos pars pintta; rodeos de 
roller squeegers) ocho ou de matérias flexivels sernelhontes 


TM: Legis Joris Doe. No. 2201490830(02) Yewr: 2014 Si: EN (Direct) Trams: unkmown Dec. Type: Agreement Ao 


EN: Beocerns and trustees (except for besoers and ttw tke and bastees = PT: Vessourns @ escoves (expelo vassouwas & Semeliuntes © escoves 
made from marten of squirrel hair), hand operated mechanical floor fetes de pelo de morte cu de esquilo), vassourns mecinicas de uso 
Simeepers, not motorized, paint pads end rollers, squeegres acd mops nanual, nfo motorizedas, bomeens & folos para pinture, rolos de 
: borracha ou Go marérias Moxivels someihantes 
{ - TM: Legis-Jeris Doc. Nes 27014A0890(01) Year: 2014 51; ON (Direct) Troms: enkmnown Doc. Type: Agreement UA 
EN: Brooms and brushes (except for besoens and the like and brushes = PT: Vessourns & escoves (exceto vassourns € semelhantes & escoves 
made from marten or squirrel hair), hand-operated mechentcal floor fettas de pelo de marta ou de esquilo), vassourns mec anicas de uso 
sweepers, not motorized, point pods end rollers, squeegees end mops manual, néo motorizedas; boneces e roles para pintura, roles de 
borrachs ou de matérias foxivels semeihantes 
| TM: Legis Juris Doc. No: J2014R1101 Yeor: 2014 SL: EN (Direct) Tronss unknown Doc. Typc: Regulation OBS 


t 4 = , | | EN: Brooms, Seusnes (including brushes otmatrery Monee of machines, PI: Vassouras e excovas, mesmo conetituinds partes de maquinas, de | _ | 
CE SL ecnritimrirae exe aasdhiclaes brataewnatat sal crue tuernie ml Mowe seeenirs. ine” arwecialbwes ert cas anh thi tesserae tsi ss olay comics tree coe I 


= al Concordance - Results 
| 
‘ 


Screenshot 89 — Search in Euramis via Quest 


But you can also access Euramis directly and have all the options by clicking on the icon or pressing Ctrl+Shift+E. 


Concordance - Results 


Tearestetient Meriory Seavey 
" Home > Search > Concordance > Results ) 


q 
Search peranecters 


Year(s) 4. Method 7 Max, results : Execution time 
= Berslc 0 second(s) 


| ___Memories Seuree __Largetis) _|_eg. ew.) 


Page(s): oO More... 


Showing 30 hits / 30 (more results availatile) 
© | TM: OF Doc, Mo.: 3201480374 Year: 2014 Si: EM (Direct) Trens: unknown Doc. Type: Requiation: ORS 
ON: Brooms, Orushes (including brushes constituting parts of machines, soplances or Pr: Vinssourns ¢ oxoves, mesmo constituinde partes Ge Tm™tquinas, de aperethes cu de 
| vehicies), hand-operated mechanical floce sweepers, not motorised, moos ond feather vekulos, vassquras meciinicas de uso menual ndo motortzadas, pircitis ¢ espanodores; 
Gusters; prepared knots and tutts tor Seoom or brush making; gaint pads and roers; cabocas oreparadas para cscovas, pincets ¢ artigos semethantes; bonecas e roles para 
Squeegees (other than roller squeegees) pinturo; rodes de borracho ow de motérias Nexivels semelhactes 
TM: Legis-Juris Doc. Nos 22014063002) Year: 2014 St: ON (Direct) Trams: unknown Doc. Type: Agreement ae) 
FEN: Brooms and brushes (except tor besoms and the like and barshes mace from marten | PT: Vnssourns # escoves (exceto vassouras ¢ semeihantes ¢ escovns teitas de pelo de 
| of squirre! hair), hond-operated mechanical floor sweepers, mot motorized, paint pads end | morta ov de esquilo), wessourss mecdinicas de uso menual, nbo motceizadas; boneces ¢ 
rokers, squeegces and mops | roles pare pintura, roles de Donracha cu de matésios hexivels semeihantes 
TM: Legis-Duris Doc. Not 22014A0630(01) Year: 2014 St: EM (Direct) Trans: waniknrown Doc. Type: Agreement Ue 
EN: Brooms and brushes (except for besoms and the like and brushes made from marten | PTs Vossourns ¢ escovns [exceto wassauras ¢ Semelhantes ¢ escovas feites de pelo de 
or squirne! hair), hand-operated mechanical floor sweepers, not motorized, paint pads and = merta ou de esquilo), vessouras mec 4nicns de uso manual, nic motorivadas; bonecas 
roRers, squieegres and mops _ roles pare pintura, roles de borracha cu de matésins fexivels semeihantes J 
f ‘TM: Legis-Juris Doc. No: 32014RLI01 Year: 2014 Si: EM (Direct) Trans: unknown Doc. Type: Regulation 2 oe 
EN: Brooms, Orushes (including Srustes constituting parts cf machines, naplances or PT: Vossoures « escovns, mesmo constftuindo partes de miquinas, de aonrethos co de 
| | vehicles), hond- operated mechanical floor sweepers, not motorised, mons and feather vekulos, vassouras mecinices de uso manual ndo motorizadas, pintdis ¢ espanacores; 
"| Gusters; prepered knots and tufts tor Groom or brush making: paint pads snd rollers; _ Cabegas preporadas para escovas, pincéis ¢ artigos semelhantes; bonecas ¢ roles paro | Ss | 
rttoe//wcbgeteectescufeuena/ sun 


Screenshot 90 — Search in Euramis directly 
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In the full-fledged Euramis interface, besides being able to see a larger number of results, you can also limit the search 
in several ways. 


Concordance 


Seurth 


Senetrh mothe: @ Matic erecth © Oeact aring © fact sentence © Al words anywhere = 6 Hide ultipie hes + 


CESS. ia] 
Comnrission a 2016 + Rewerse 100 
aA 2015 = S 0 
rr 016 ® + Reverse & Indirect co 
P 2013 
8D 22 
800G wi 
CADt zoo 
CADZ vw 2009 ~ 
Subevit Save settings Help Reset simon 


Screenshot 91 — Search in Euramis directly with options 


® Euramis Help for full information on its features 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT —A TRANSLATOR’S GUIDE — MJM — June 2015 


H.3. Search in IATE se 


You can search IATE via Quest and have access to an interface with limited options or access it directly and have all the 
options by clicking on the icon or pressing Ctri+Shift+L. 


The direct link to IATE full interface allows you to use all the search features and also to create entries and add 
comments. 


@®D Section 1.7. for more information. 


CEE cts: CE o 


em» tnglish sq cogce Meee. oD ( 
4 pt Portuguese 1010 Se bo-rachs D a iis 

en - Enghsh squeegee COM eee Cc 

pt Portuguese rospetor + a oO 


Screenshot 92 — Search in IATE via Quest 


~~ | Commutation | | Duta Manipulition | | Communication | | Help| | Chunge Passweed lal 
* 
* 
* * 
oo wa ea 
aanegee | Sufoene Query) —— 
Your search rotumed: 16 Hes 3 | 2] Time, 0.602 
Ce | 
senor COM see oO 
ete ~ Exnyliote 
serateher ae 
fe > Feeney recheie OM one 
pt Portugeese mapedor OM eee tw 
vier c au uj 
da Damishy skraber ' 2 a8 
vvaber ' ” oO 
de German Weeher OO) ff 
> Greek sullen pas UypiuW EwpOVERT ” | | 
| em English squengee , ry ‘eq 
! coe - Spaniel ercts f “2 cc) 
fe french recente t 2 ) 
taken reece t +2 oO 
i= Menon wz 4 
12 
a = | 


Screenshot 93 — Search in IATE directly 
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In the full-fledged IATE interface, besides being able to see a larger number of results, you can also limit the search in 
several ways. 


| Consultation | | Dats Manipuletion| | Communication| | Preferences| = || Help |_| Change Password | - 


* 
— 


Looking for :queegee 


Last 
queries 


Term type ©All © Term/Phrase ©) Term Only © Phrase Only © Abbrev © Formula © rps 


Source Language Target Languages Institution 
en - English v An Any +] 
g 1 - Council 
bg - Bulgarian 2-COM 
cs - Czech i+ | 3-EP | | 
Selected values pt 
Matching © Exact Match © Exact String ® All Words © Any Word 


Include Pre-IATE results |v 


meniteperpage 10 sat 


Extend Search Criteria Hide 


Term reference 


Any 
AD - ANDORRA 

AE - UNITED ARAB EMIRATES 
AF - AFGHANISTAN 


Oriai 


(Find! 


Domain code not specified(00) 
Politics(04) 
Political framework(0406) 
Political ideology(0406001) 


Domain code Political institution(0406002) 
(Double-Click to add and remove 
items.) 
Logical operator for domains v 


Minimum Reliability Value 0 - downgraded prior to deletion ¥ 


Type of Search & Normal - (Max.5O0Ohits) © Extended - (Max.1500hits) 


= 


Screenshot 94 — Search in IATE directly with full options 
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H.4. Search in DocFinder a= 


DocFinder gives access to the EU databases shown in the dropdown menu below. 


By highlighting the reference in the Editor and clicking on the DocFinder icon or pressing Ctrl+Shift+F, you can access 
those databases. 


See pr ern en ee ge re gre we ree re cee etn re eee we re 


outubro de 2013, que abera 0 aneso | d> Reguiamecto (CEE) n° 2656/67 do Conseino relative @ nomencieura 
pauial @ estatisica 2 a pauia eduaneies comum (JO L 280 de 31.10.2013, 9. 1). 


Meuah ba pashan scas va enka Agoaeind ogaen ee om 7 
sal pat pin enone my spl 

Ganamied ee pac dtien eon oleae 

AS relerdncias 305 cosines & Ses pnecdes Cas mercacerias estho em eH com a Nomencigtura 

Nesta o- Sothartande ders cen chentegoeetes ced UE) 0,8 1001/2083 oa Comisse. of 4 ce 

SSUES de 2073, que ebere 0 reso | dc Requiarento (C55) 0 255687 do Conseno retatun 8 nomencieuna 

pauial © osiaasice & 2 pauiz aduanein comm (0D L290 09 31.19 2013, p 1) 


cm 


p Este sitio utiliza etochess (testemunihos de cose xo) 
Transition lest modified by macheme on 16-Feb-2015 at 12.2211 4 . |} para metnorar 2 sua expenéocia de navegarao Deseja |p 
osuesmnt sdapataroanapsannnnehienranaber hier cl Accept / Balu |) psiedi-tos? Aceitar / Recunsat 
{COR 
c n aan remade ; ; . |} Secenae pox wsereitader do documents * 2 
*} Descendente = 


Seutindts 1-7 de? 


: 31987R2658R(04): CORRIGENDUM TO 

Vases) ¢ area Council Requtation (EEC) Mo 2658/87 ot 23 daly 

tenement e039 1987 on the taniff and statistical nomenclature 
és and on the Common Customs Tariff 


Ohreca ext access: fl sutton Counc! of the 


faropest Unor 
Form equates Date of dxcumentt 24051588 


FGB7RIGIARIO): RECTIFICACAD Dt 
Reguiamento (CEE) n.” 2658/87 do Conselho de 
23 de Julho de 1987 relative 4 nomenclatura 
pautal ¢ estatistica e 4 pauta aduaneira comum 


Acesso direto ao texto (6) Aeter: Comnetne da Unita 


& Esropela 
Data do Gocamesto. 
Forma reqalermerto 26rth)1 982 


31987R2658R103); This document does mot exest 
in your search Language. 
OL ER AS21986, 2 32-22 OD 


31987R2658R003). Este documento nao existe 


rpraenapeprmeiy merece manson pry rel 
Cectonery Commence empty Teaemiaions «= Gownery fairy Matchen 


Peget ndenacet mt 15 0 


Screenshot 95 — DocFinder results 


@® DocFinder Help for full information about its features. 


— PART /— 
TERMINOLOGY 
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|.1. Glossaries — TransTips — Auto-completion — 
Auto-text © 


In OmegaT, using and managing glossaries are very simple operations and, considering that in DGT we have as 
terminology database IATE, simple 3-field glossaries are a sufficient complement to satisfy our needs. 


In DGT-Omegar, the display of glossary entries has been further improved. 


1.1.1. General 


Omegat allows you to use or create simple glossaries with 3 fields and it can handle very large glossaries (even with more 
than a million entries) without noticeably losing speed. 


Your (writable) glossaries can be bilingual or multilingual, as you prefer. In fact, as OT doesn’t store the language code for 
each entry, you can have any language both for source and target terms. 


You can also use the glossary — or the Notes (“@ Part J) — to collect information that might be used later to create an 
IATE entry. 


The glossary function finds not only exact matches with the glossary entry, but also inflected forms. It is not perfect, but from 
EN to other languages — which is the bulk of DGT translations — it works ... more or less. 


The glossaries for the project are, by default, in the project \glossary folder, but they can also be anywhere else you want — 
as long as they are all together in a folder (even with subfolders). 


ee This way you can share the glossaries for a given project in real time by having them in a server location while 
keeping your project in your computer. 


Glossary entries for the segment open in the Editor are displayed in the Glossary pane. They can also be displayed in a 
dropdown menu by right-clicking with the mouse on the source term in the Editor to display the TransTips (Translation 
Tips) suggestions. 


Reject fae Cele View thes Opnans vem 
, » 4s ; 
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Screenshot 96 — Glossary pane and Transtips (Translation Tips) in the Editor in a drop-down menu 
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The glossary entries can also be displayed using the Auto-completion — Glossary feature by positioning the cursor on 
the target term and pressing Ctrl+space until the option Glossary entries is displayed. 


Machine Transtation - © 0 Glossary -@0 


Deere ee me ree pie ee ee rr ee ee ee re re re 


conjunto para a era digital, bem como identificar as agdes destinadas a * | digital 1 
assegurar a interoperabilidade das redes e servicos. _) 1. digital 
“}| 2. numérico 
digital age = era digital 
digital network 
1. rede digital 
2. rede digital integrada 


| 
4 


Cooperation shail be developed in all areas related to the EU <t0/>acquis<i | 


\/> regarding the information society. digital object identifier 
1, DO! 
It shall mainly support Kosovo's gradual approximation of policies and 2. identificador de Objetos Digitais 
legislation in this sector with those of the EU. global = global | 
lobal network 5 
The Parties shall also cooperate with a view to further developing the Crome 
information society in Kosovo. 2. rede global 
3. i 
Translation last modified by machame on 09-Apr-2015 at 11:51:55 Roma avel 
Global objectives will be to prepare further society as a whole for the identifying = identificagao 
digital age, as well as identifying measures ensuring interoperability of interoperability 
noammal Tae TRE . 1. funcionamento reciproco 
2. interfuncionamento 
A cooperacao tera por objetivos globais a elaborar novas da sociedade no seu 3. interoperabilidade 


ene Dance ee bem como identificar as acdes destinadas a 4. interoperacionalidade 
assegurar a intero vow : 


canit “ poe at : a interoperability of networks = interoperabilidade das redes 
end segmen SS Se eercebaes Ones ~ | interoperability of services = interoperabilidade dos servicos = 
Dictionary Comments Multiple T 


Glossary entries 
>rojact autosaved on 11:54 Press Cirl+Page Down to go to Autotext entries suggestions. 191/1231 (3360/12097, 18258) || 168/201 
—— Press Ctrl+Page Up to return to Character table suggestions. — = — 


Screenshot 97 — Auto-complete — Glossary 


In the public Omegat is available the TaaS service (Terminology as a Service), but for the moment it is not available in 
DGT-Omegav. 


® the public OT Help or the OmegaT Guide — Chapter 20 for more information on TaaS. 


1.1.2. Glossary formats accepted 


OmegatT accepts glossaries in the following formats: 
y UTF-8 format (with the extension .txt), either for read only or read/write glossaries (writable glossary). 


y TBXas a read-only glossary (extension tbx). TBX — Term Base eXchange — the open, XML-based standard 
for exchanging structured terminological data, has been approved as an international standard by LISA and 
ISO. 


y CSV: This format is the same as the tab separated one: source term, target term. Comment fields are 
separated by a comma’',’. Strings can be enclosed by quotes ", which allows having a comma inside a string: 


Example: "This is a source term, which contains a comma’,"c’est un terme, qui contient une virgule" 


& As the UTF-8 format is the only one that can be used both for read and read/write glossaries, | suggest you adapt 
your (existing) glossaries, if any, to this format as you can at any time use them as writable glossaries. 


Ll 
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1.1.3. Glossary types 


OmegaT can have any number of read-only glossaries — even in subfolders of the project \glossary folder — but it can 
have only 1 writable glossary — also called the priority glossary. 


This glossary is, by default, named glossary.txt and is automatically created when you create the first terminology entry 
in your project. 


The results from the writable glossary are displayed first in the Glossary pane (and in bold) and in Transtips. 
So, in the \glossary folder of your OT project you can have: 


yY The IATE extraction automatically imported to your project by default (if you have not unticked the IATE box in 
the DGT-OT Wizard when creating or updating a project). 


y The writable (priority) glossary that is created for your project in the \glossary folder when you create a first 
glossary entry and which is simply named glossary.txt. 


yY Personal/Unit/Department glossaries, if any, that you would like to use in your project and which you can 
adapt to be used in OT. You will have to copy them to the project \glossary folder. 


You can also make one of these glossaries your writable glossary — as long as it is in .txt format — just by 
(re)naming it glossary. 


1.1.4. Glossary .txt format — Adapting existing glossaries 


OT accepts the following simple txt file (UTF-8) with 3 fields separated by tabs and ended by a carriage return (the last 
field is optional): 


Source term<tab>Target term<tab>3" field with whatever you like or empty<Return> 


f=] My-notebook-MJM5.ttt x] f} TERM-ESA95-European-System ofAccounts-EN-P T txt x] f) TERM-DEVCO-PRAG-GRANTS_EN-P’ 
1 accidental damage ~-danos acidentais ~~ TERM-ESA9Sii@ 
2 accounting~contabilizacdo~>TERM-ESA95 ii 
3 accounting-contabilidade —-TERM-ESA95ii 
4 accounting framework —-quadro contabilistico ~>*TERM-ESA95ii 
5 accounting matrices ——-matrizes contabilisticas -~—TERM-ESA95Sii@ 
6 accounting period——periodo contabilistico—TERM-ESA95ii? 
accrual basis~——especializagdo econémica>TERM-ESA95 ii 
accumulation accounts (III) ~—>contas de acumulacgdéo (III) »*TERM-ESA95iig 
9 acquisitions less disposals aquisigées liquidas de cessées~-TERM-ESA95iig 


Screenshot 98 — Writable glossary opened in Notepad++ for editing 


You can convert glossaries that you have — either in table or text format — and save them as a plain text (txt), UTF-8 
file. 


If you want a glossary to be your writable glossary — the glossary where you will create new entries for that project — 
the easiest way is simply to (re)name it “glossary”. 


& If, when converting your glossaries, you make a mistake and have 4 or more fields — instead of 3 — OT will read 
the 3 first fields — displaying them in the Glossary pane — and simply ignore the others. But it will not be 
“blocked”! 


190 
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1.1.5. IATE filtered extraction B@ 


When you create a project using the DGT-OT Wizard, it will automatically copy to your project — to the \glossary 
subfolder — a filtered extraction of IATE only with the entries relevant for the source terms present in the documents in 


your project. 
(Bl CAUsers\machame\AppData\Local\DGT\OmegaT Projects\TEST-CNECT-2014-80023\glossary\EN-PT.txt - Notepad + = ©) 
| File Edit “Search View Encoding Language Settings Macro Kun Textex Plugins: Window |? x 


oJHS . Bi sO Peciae 2 2 2S B1FTPA Sew gi* wee 
ee) Een-PT on 5} 


single 
single 
single 
single 
single 
single 
single 
single 
single 
single 
Single 
single 
single 
single 
single 


we Se ee ee 
mm ib > dm de De 

anh he ae > S 
= © ; ¢ < 


a w 


single 
single 
single 
single 
single 
Single 
single 


(Normal text file 


element semiconductor semicondutor elementar 
end-of-year bonus prémio tnico de fim de ano 

end wrench chave de bocas simples 

ended input -circuito de entrada de terra 

ended output -circuito de saida de terra 

engagement condu¢éo por um perfil 

engine qo-around -horreqo monomotor 

engine go-around remessa dos gas com um sé motor 

engine monorreator 

engined--de um s6 motor 

engined—monomotor 

entity authentification--autentica¢ado da entidade simples 
entity unidade fiscal 

entry accounting contabilidade por partidas simples 

entry compressor compressor de entrada tnica 

entry interference level-nivel de interferéncia procedente de uma sé fonte 
entry point —-balc&o tnico 

entry point -ponto de entrada tinico 

entry pump bomba de fluxo simples 

entry turbocompressor turbocompressor de simples fluxo 
entry visa visto de entrada tnica 

epi-layer FET FET de camada epitaxial Gnica e¢ jungédes difundidas 


ee eee 


length: 4607992 lines: 7952Ln:79513 Col: 32 Sel: 0/0 Dos\Windows UIF-3 INS 


Screenshot 99 — IATE extraction EN-PT with source and target terms only, viewed in Notepad++ (the default when 


you open a txt file) 


The entries from IATE will be displayed in the Glossary pane, in TransTips and in Auto-completion — Glossary 


entries. 


& lf — while translating — you change your mind and prefer not to have this IATE extraction mixed up with other 
project-specific glossaries you might have, just open the project folder (Ctri+Shift+F1), open the project 
\glossary folder and delete that file or drag and drop it to another location... in case you change your mind again 
and want to use it after all! 
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1.1.6. Glossary location 


You can have the glossaries you want to use in a different location in your computer — and not in the project \glossary 
folder — so that you can use the same writable glossary in all your projects without having to copy it every time you have 
a new project and also in order to be able to use the same writable glossary if you are working in more than one project 


at the same time. 


You just have to change — in the Project — Properties menu — the folder where you want OT to look at. 


In the Glossary Folder field, click on Browse and select the folder where you have the glossaries you want to use in 
that project and in the Writable Glossary File field, select the file you want to use as your writable glossary. 


™ Don't forget that — if the glossaries are in a folder outside the active project — there will be no automatic backups 
of your writable glossary. So don’t forget to do manual backups of your writable glossary to the H:drive! 


™ If you want to use this in a project while teleworking, check that there are no connection/speed problems. 


Default glossaries location 


Changed glossaries location to a general folder to be 


@ Ecit Propect 
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Source Mies Fekter Beow-ne 

Usersipnothane | pgOats \Lacal OCT\OmegaT_frojectsh DLS OS seurce 
Trandanos Wamary Folder Browee 


Ghemnary Folder 
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AOSSARE S| 
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Screenshot 100 — Properties menu to change the folder path where the glossaries are stored 


Ge It is also here that you can change the name of the writable glossary even if you don’t want to change its location. 


™ Don't forget that both the read-only glossaries and the writable glossary must be on a same folder. But you can 


have subfolders if you want. 


OT will store the new location and, every time you reopen that project, it will “remember” the location of the glossaries as 


you last defined. You can change the location again at any time. 


oe 
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1.1.7. Sharing glossaries 


If you are translating a project with other translators using TeamBase to share translated segments, you may also want 
to share glossaries. 


As you can change the glossaries location — either to another folder in your computer or to a server — you can change 
the Glossary Folder and Writable Glossary File fields in the Project — Properties menu to a common folder in a 
server accessible to all the translators you want to share glossaries with. 


™ If you want to use this in a project while teleworking, check that there are no connection/speed problems. 


While there is no problem in what read-only glossaries are concerned, with writable glossaries there may be conflicts. So 
it is recommended — to be on the safe side — that each translator has his/her own writable glossary by naming it 
whatever you and your colleagues want and defining it in the respective Writable Glossary File field in each of the local 
projects. 


This way: 


y The non-writable glossaries — IATE extraction and other personal/LD/Unit glossaries, if any — are accessed 
in read mode by all the translators 


y The writable glossary of each translator is defined as such in the Writable Glossary File field, but for the 
other translators it will be a read-only glossary. 


Also this way each translator is the “master” of his/her own glossary. As several entries for the same term will be 
displayed together, it is easy to spot alternatives/inconsistencies. 


File locations 


Source Files Folder: Browse 


C:\Users\machame\AppData\Local\DGT\OmegaT_Projects\TEST-CNECT-2014-80023\source\ 


Translation Memory Folder: Browse 


C:\Users\machame\AppData\Local\DGT\OmegaT_Projects\TEST-CNECT-2014-80023\tm\ 
Glossary Folder: 

P;\pt\DGT-OT-CNECT-SHARED-GLOSSARIES\ 

Writeable Glossary File: 


P:\pt\DGT-OT-CNECT-SHARED-GLOSSARIES\Glossary-Translator-1.bxt 


Dictionary Folder: Browse | 
C:\Users\machame\AppData\Local\DGT\OmegaT_Projects\TEST-CNECT-2014-80023\dictionary\ 
Translated Files Folder: Browse 


C:\Users\machame\AppData\Local\DGT\OmegaT_Projects\TEST-CNECT-2014-80023\target\ 


Screenshot 101 — Changing the location of the glossary(ies) to a server location and defining the glossary 
which is the writable glossary 
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|.2. Creating and editing glossaries 


The glossaries in OmegaT are very simple to manage and use. 


™ If you want to use a single — bilingual or multilingual — writable glossary for all your projects, don’t forget — before 
starting translating a new project — to change the glossary path as explained in Section 1.1.6. above. 


|.2.1. Creating a glossary entry 


As already explained, if you do not have a glossary in your project, OT will automatically create a new glossary for that 
project that, by default, will be saved in the \glossary subfolder of your project and named glossary. txt. 


aye Create Glossary Entry <j 


Enter 4 source term, 4 target term and 4 comment: 


Source term: Jeloud client 
Target term: Jeliente da nebulosa computacional 


MIM Talked to Paulo] ~ 


fi 
Glossary File: D:\vvorking 
Documents\OmegatT _Projects)GLOSS4RIOS i My-notebook-MI 


M.Ext 
cence! _| 


Screenshot 102 — Create a new entry in the writable glossary 


Comment: 


To create a new (or first) glossary entry, just highlight the source term you want in the Editor and press Ctrl+Shift+G (or 
select Create glossary entry in the Edit menu or use the icon). 


The highlighted text will be automatically inserted in the Source term field in the Create Glossary Entry window. 


After inserting the source term, you can also insert the target term by highlighting the target term in the Editor pane, 
positioning the cursor in the Target term field of the Create Glossary Entry window (or pressing Ctrl+Shift+G again). 
The target term will be copied too. Or you can just write the target term in the respective field. 


The same happens with the Comment field if, for instance, you want to copy a context. 


|.2.2. Modifying or deleting one or more glossary entries 


In OT, it is not possible to modify or delete an entry, i.e., you can create a new entry but you cannot edit nor delete an 

existing entry. 

Ge If, at first, the edit feature to change or delete entries seems to be missing in OT, on the long run | came to prefer it. In 
fact, it is faster to freely change all the entries you want in a text only file at the same time! 


If you want to do it, open the file that is stored in the \glossary subfolder of your project — or anywhere else — with the 
command Ctrl+Shift+F2 and process it in Notepad++ (the default application for txt files). 


File Edit Search View Encoding Language Settings Macro Run TextFX Plugins Window x 


oth & +» 62|e S| a> | aa! 7/3233) > ww FA om 
be] My notebook: MJM5 pa 5] 


cloud computing nebulosa computational MJM Preferred by some authors. 
Cloud Computing computa¢cao nuvem MIM - algumas ocorréncias 

Cloud computing computagao nuvem MIM some references on the Internet 
cloud computing computac&So nas nuvens MIM = some references - very poetic 
cloud computing nuvem computacional MJM 


soft power poder pela persuasdo MIM- Alternativas: poder suave, poder 


persuasivo (Teresa), poder de persuas&do, poder brando, poder de 
influéncia, poder subtil, poder discreto. Poder pela persuasdo nao tem 


Screenshot 103 — Writable glossary open for editing in Notepad++ 
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You can also have that file open in Notepad++ while working with OmegaT and inserting terminology entries in Notepad ... 
as long as you respect the format (Source term<tab>target term<tab>3" field<Enter>). Or you can just add to it a list of 
terms in a batch operation. 


You can make changes to previous entries and do save (Ctrl+S) in Notepad++ and those changes will be immediately 
shown in the Glossary pane, in TransTips and in the Auto-completion — Glossary entries list. 


If you edit the glossary in Notepad++, it is advisable to have the display showing tabs and end of lines by selecting — in 
the View menu — the option Show Symbol — Show All Characters, as shown below, so that you don’t mess up the 3 
fields! 


| fie ait Seancn [ifinw) Eecosng Lanquige Setiegs Maco Run TextX Puges Wincow ? x 
oh Always on Top TED RA AEG R A? wz 
Toggie Full Screen Moce Fil 


Nomad text {Be CovWindom: UTF-8wio BOM INS 
fel enti Post-it 2 pero ah 

Show Symnbot ' Show White Space and TAS 
Zoom , Stow End of Une 
Move/Oone Cariet Document +) Show All Characters 
he Shem Fedent Guide 

Vv Word wrap Show Wrag Symbol 
Focus on Another View Le ae 


Screenshot 104 — Notepad++ View options 


You can select entries to send to other translators/your reviser/LD terminologist as you can easily copy the entries you want 
to give/send them or that you want to include in the Tradesk Note. 


Just copy/paste those lines (or the whole glossary) into another application — an email, Word, Excel, Tradesk Note — and if 
you want convert the text to a table to make it more “presentable”. 


ITU Consthution = Constituiceo da UIT 
MJM - Link para a Constituigao em PT; 
TEx pansy Geanlvalya wei hitp:lhwwrw d.ue ptiChCBB/OUUITIUIT.1995-10.A.85-Convencan.him 
3-95 -ConeeDCAL pemer = 0 
my mi WJM - Nameresbo do articulade na Constitulcge da ITU: bis (A), ter (B}, quater (C}, 
penter (D}. Constituigao EN: 
http:teww uintienhistory/PagesConsStutionAndConvention.aspx, Constituigao 
PT (2004): http:!iwwrw fid.uc. pUCUCEEOMUITIUIT.1995-10-4.95-Convencao.htm 


langeige Seteg M. 


treeaes 


4BBi Pcia’s)* (GR AVETBA SSR aA? = 


He (th Seen View 
oS oot 


Screenshot 105 — Writable glossary open while editing 


1.3. 
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Glossary entries in the Glossary pane ae 


In DGT-OT, the display of entries in the Glossary pane has been improved. 


The Glossary pane now displays the entries in the following way: 


y 
y 
y 


The entries are displayed with source term in blue, target term in green and comment, if any, in black. 
The entries from the writable (priority) glossary are displayed first by alphabetic order and in bold. 


The entries from other glossaries are displayed after the entries from the writable glossary and also by 
alphabetical order. 


Entries can be highlighted and you can use copy/paste or drag/drop to copy them to the segment open in the 
Editor. 


Project Edit GoTo View Tools Options Help 


Bem eAldaAvY|{Telée?G 


22 ea oO 


Glossary — Oo 


/environmental impact eZ 
1. repercussG6es ambientais E | 
MJM - variant - test 1 
2. efeitos no meio ambiente 
3. impacte ambiental 
| 4. impacto ambiental 
|environmental impact assessment 
| 1. aferigao do impacto ambiental 
| MJM - variant - test 
2. AIA 
| 3. avaliag¢ao do impacto ambiental 
"on air" = "no ar" 
|A = <i>speed</i> 
|AIR 
1.AIR 
2. agao inovadora de reabilitagao 
3. programa de investigag¢ao em agricultura e agro-industria, incluindo a pesca 
4. reator por injegao de ar 
| Development 
| 1. DG VIII 
| 2. Desenvolvimento 
| Development Impact Assessment Framework = quadro de avaliagao do impacto no desenvolvimento 
|Efficiency = Eficiéncia ba 


Dictionary Comments Multiple Translations Notes FuzzyMatches Machine Translation 


Screenshot 106 — Entries in the Glossary pane: first from the writable glossary in bold (with comment starting 


with MJM) and then the entries from other glossaries (in this case IATE) by alphabetical order 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


1.4. Glossary entries in TransTips 


Translation Tips (TransTips) allows you to easily know what terms/strings have entries in the project glossaries. 


The terms/strings with a blue linear and bold underline mean that there is an entry in one of the glossaries (displayed in 
the Glossary pane) if you have this option activated and you have one or several glossaries in the \glossary folder of 
your project or in another folder you define. 


Project Edit GoTo View Tools Options Help 


BBSM DEAK AVIV TRV eIO RBS +O 


Editor - ELARG-2014-80031-03-01-PT-TRA-00.DOCK O Glossary .- @:o 
— 
| percocet on eneray efficiency, * | | Renewable energy sources = fontes de energia renovaveis = 


aoe and the |MJM - case - test 3 
fontes de energia renovaveis “energy sector, |renewable energy source = fonte de energia renovavel 
fonte de energia renovavel saving, energy ‘MUM - singular - test 2 
fontes renovaveis de energia | y and studying renewable energy sources 
FER ental impact of 1. fontes renovaveis de energia 
fontes de energia renovaveis | umption; : MJM - variant - test 1 
energia renovavel 2. FER 
~}|_3.fontes de energia renovaveis a 
ations Notes FuzzyMatches Machine Translation 


Cut 


Copy 211/1231 (3363/12097, 18258 
Paste 

Add glossary entry 

Set empty translation 


Remove translation 

Register identical Transiation 
Use as Default Transiation 
Create Alternative Transtation 
<t0/> 

<tl/> 


Screenshot 107 — Display of glossary entries with entries from the writable glossary displayed first. 


As shown in the screenshot above, a different entry is created if the same term (in this example renewable energy 
sources) is in upper or lower case or in the singular or plural, but they are all displayed both in the Glossary pane and 
in TransTips. 


The lemmatizer works — not always but frequently — and displays the several entries. 


To insert terms from TransTips, just right-click the mouse on the term you want and click on one of the options displayed 
in the dropdown list and it will be inserted in the target segment in the Editor pane at the place the cursor was when you 
left it or will replace the words in your target segment if you have highlighted them first. 


In the dropdown menu you can further select to Add a glossary entry to the project writable glossary and also have 
other features that are explained in other Sections of this Guide. 


If you do not want to see TransTips’ blue lines, in the Options — Glossary menu, uncheck the box Enable Transtips. 
The glossary pane will continue to display the entries in the glossary(ies) but they will not be marked with a blue linear 
underline in the open source segment. 


In this menu you can also select to have TransTips only for exact matches. 
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1.5. Glossary entries in Auto-completion @ 


One of the options of the new OT Auto-completion feature is that you can insert — in the target segment open in the 
Editor — terms in your glossaries in the target language field. 


To use the Glossary feature in Auto-Completion, press Ctrl+space and cycle through the options by repeatedly 
pressing it or pressing Ctrlt+Page Down/Page Up to select the desired option: Glossary entries. 


By default, the terms in the project glossaries will be displayed with the source term and target term by alphabetical 
order. 


® Part K on Auto-Completion for information on how to change the display. 


With the Up and Down arrows, you can select the term you want and press Enter. The selected term will be inserted at 
the position of the cursor in the target segment in the Editor. 


You can use this feature in several ways: 


1— __ Display all the terms in the glossaries for a particular segment: If you have a target segment empty (without 
any match from translation memories nor MT output), just press Ctrl+space (cycle through if necessary) to 
display all the entries in all the glossaries for that particular segment. 


Translation last modified by machame on 01-Apr-2015 at 08:45:11 
Only <'0/>25% of Europeans <!1/>can access <'2/>4G <(3/>in their hometowns, dropping to <!4/>4%<t5/> <(6/>for the rural 


population<t7/>. 
<segment 0041 ““TRA™ > 


CAN — anulacéo 

CAN — barramento CAN 

CAN — CAN 

CAN — cancelamento 

CAN —~ rece de zona do controlador 


FOR — FOT 4G access 
R — livre sobre vaaso ~ acesso a redes 4G 
—nneto. ines. 


nae caminhoc-dae-ferrn 
Glossary entries 
Press Ctrl+Page Down to go to Autotext entries suggestions. 
Press Ctrf+Page Up to return to Character table suggestions. 


Screenshot 108 — Display of all the entries in all the glossaries via the Auto-completion feature. 


2— Display only of terms beginning by a certain character or string: If you write a character (or more) and press 
Ctrl+space, only the entries beginning with that/those characters in the target language will be displayed in the 
dropdown menu. 

reforms can decrease prices of mobile services and boost productivity over time (estimated EU-wide GDP increase 


Spectrum 
<(0/>between 0.11% and<('/» <(2/>0.16% over 5 years<!0'>) 
<segment 0040 ""TRA™ > 


Estimate — Estimativ ° 
estate —- axtimnatey 
estimate —- estitnatiy @ orcomentat 


noeciram —» = espectro 
soectrian —- ©8 
Only <t0/25 foecrun — esoetro 
Population <! | Gossary entries 

s Press Cirt+Page Down to go to Autotext entries suggestions 
Apenas 25 % Press Ciri+Page Up te return to Character table suggestions, 


‘>In their hometowns, dropping to <!4/>4%<t5'> <t0/>for the rural 


Screenshot 109 — Display of the entries beginning by a certain character (in the target language) in all the 
glossaries via the Auto-completion feature. 


Spectrum reforms | can decrease prices of mobile services and boost productivity over time (eatimated EU-wide GOP increase 
<10/ between 0. 11% and<t1/> <(2/>0.16% over 5 years<!i/>) 

<segment 0040 “*TRA™ > 

Reformas do e: 
<ond prssrennco uae spect wer 


epectruen — # 


Only <(0/ 25% Hdeaad rapes fmintergrenpien soning in their hometowns, dropping to <(4/»4% «15!» <(6 >for the rural 


reiting = 6 


Screenshot 110 — Display of the entries for the same term but beginning by a string of characters (not one) 
in all the glossaries via the Auto-completion feature. 
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3— Continuous display of all the glossary entries as you type or move the cursor: When you press Ctri+space 
(displaying the Glossary entries), you can keep seeing the entries as you type — or move the cursor — until you 
choose to insert a term from the Auto-completion list by pressing Enter. 


@ when you select an entry from the Auto-completion list, the drop-down menu will close down. Just do 
Ctrl+space again to continue having auto-suggestions from the project glossaries. 


If you have project-specific glossaries for complex technical projects, this auto-completion from glossaries feature can be 
very useful — both for translating and for revising — as it can save you a lot of time in terminology search. 


The only drawback is that you don’t have the 3" field — sometimes with important information — displayed. But you can 
have the Glossary pane side-by-side displaying that additional information. 


| Project Edit GoTo View Tools Options Help 


B0¢n a¢| 0 § Es AVY AAQ4OH BF HwAO 
| Renewable energy sources = fontes de energia renovaveis 


 secgy Srucitiion and conteienatlons MJM - case - test 3 
<segment 17807“ TRA™ > renewable energy source = fonte de energia renovavel 
fontes de energia renovaveis e o impacto . 


MJM - singular - test 2 

renewable energy sources 

1. fontes renovaveis de energia 
ongu sve Ant - test 1 


ambiental do setor da energia, promovendo, 
assim, cL daeieleint sialon! a eficiéncia 


avaliagdo e atenuaca eneray source —= fontes de eneraia 

impact source — fontes de impacte : 
SUrenewable eneray sources — fontes renovaveis de eneraia 
renewable eneray sources — fontes de eneraia renovaveis 
renewable sources of eneray — fontes de eneraia renovaveis 
source of eneray — fontes de eneraia 
sources of eneray — fontes de eneraia 


pnergia renovaveis J 


ee 


211/1231 (3363/12097, 18258 


Press Cirl+Page Down to go to Autotext entries suggestions, 
Press Ctrl+Page Up to return to Character table suggestions. 


Screenshot 111 — Display of the entries in all the glossaries via the Auto-complete feature (without 3 
field) and in the Glossary pane (with 3" field, if any). 
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/.6. Auto-text entries o 


You can also use the Auto-text entries feature of Auto-Completion to insert text from a list of auto-text entries that you 
may which to use or to create. 


As with the Glossary entries feature of Auto-Completion, you can also have the Auto-text entries continuously 
displayed while you type ... and as long as you don't insert an auto-text entry from the list. 


When you do, the dropdown list disappears. Just press Ctrl+Space again and it will be displayed again. 


015 TT 


Translation last modified by machame on 30-Mar-2015 at 07:42:49 iss 
Why we need a Digital Single Market 

<segment 0001 **TRA** > 
Por que razao precisamos de um 


<end segment> 

Autotext entries 

Press Ctri+Page Down to go to Missing tags suggestions. 
315 million Europeans<t0/> <t1/>USt press Ctrl+Page Up to return to Glossary entries suggestions. 


Screenshot 112 — Display of auto-text entries via the Auto-complete feature. 


®) Section K.3. for information on how to add or delete entries in the Auto-text list. 
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|.7. Contributing to IATE a 


In DGT, translators can create entries directly in ATE or add comments to existing entries. 
It is this interactive capacity that gives IATE its name: Inter-Active Terminology for Europe! 


You can manage terminology issues combining, on one hand, the glossary and notes feature of OmegaT and on the 
other IATE. i.e., terminology in a particular project can be “digested” within OT and maybe at a certain point it will be 
worthwhile to make an IATE entry with that information. 


The translator can also “communicate” terminology problems/solutions to the Language Department Terminologist who 
will process those entries. 


Whatever the option chosen, DGT translators contribute to IATE ... either directly or indirectly. 


l.7.1. Golden rules for feeding IATE ... from my point of view 


1— __IATE is not a “shoe box” or personal file for collecting terminology. For that we have the glossary(ies) in OT. 

2— __IATE is about quality, not quantity. Its purpose is not to record/justify all the (dozens, hundreds of) terminological 
decisions a translator makes during the translation of a project, but to single out those that, according to the 
translator: 


y Might need further study or the opinion of others on the long run, 

y Might deserve clarification, 

yY Were the object of (some) research (neologisms or problematic terms) which would be of interest to 
share with others, 

y Could be proposed for harmonisation. 


3— An IATE entry must have added value (information) beyond what is obtainable from Euramis matches. 


4— — An lATE entry should normally contain, besides the source and target terms: 

a) Domain(s) (mandatory) 

b) Problem language (important for IATE) (mainly EN/FR for us) 

C) Context and/or definition (which, with Internet, is normally possible to find and use (copyright 
restrictions?) 

d) Note with explanation of problems it raised, eventually with variants (full and partial synonyms) 
examined/found in the target language and reasons for the choice made 

e) Related terms (if any) 

f) Confidentiality (Default: public) 

g) Synonyms, if any 


5— Translators should not be overburdened and validation must be a real added value 


Validation is essential, especially in cases where tentative translations are proposed and used, which require 
further study. 


This happens frequently with non-legislative documents (long reports, communications, working programmes, 
etc.) with a significant amount of new terminology (really new concepts or new terms in Community documents) 
for which there may not be time to make thorough consultations. The reward for the work done by the translator 
in adding these new entries to IATE will be that the next time he/she has the same term to translate, some 
expert may already have provided a better solution or validated the translator’s initial proposal. 


The situation is different with legislative proposals, for which this validation must be carried out before the 
translation of the document is released. 


6— IATE must satisfy, above all, the needs of the translators of the European Institutions and, more specifically, the 
needs of the institution feeding it (which may vary significantly): Legislative texts and other documents 
translated by the Commission are terminology sources for other institutions and the general public. However, for 
DGT translators (who produce the translations) what is of interest is the upstream process that leads to the 
choice/use of a particular term in the target language. 


oe 
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.7.2. Creating an IATE entry 
To access IATE’s full interface, just click on the IATE icon or press Ctrl+Shift+L. 


The easiest way is to create a new entry is to use the Basic interface by selecting it in Data Manipulation — Create 
Entry — Basic. 


C= = | Consuration | 
* 
— fn ne =~ 


Screenshot 113 — IATE basic interface to create new entries 


| Communication | | Preferences | | Help || Change Password =. 


You can use the information you collected in the OT glossary or in an OT note and copy/paste it in the relevant fields: 
q_ Domain — this is a mandatory field as IATE is a concept-oriented database. 
If in doubt, just choose — from the Level 1 dropdown menu — the general domain. In the validation process, 
the terminologist will check and correct it if necessary. 
q_ Source and target term — mandatory fields, of course 
q_ Other fields: provide as much information as you can. 


When you finish, just click on Save. The entry is automatically sent for validation. 


<® IATE Help for detailed information. 


' ~ oe te 
Public ~~ = Conficentiaticy 


“Public > < Cunfidentativy 


mo 


| 


} 
| 


Public TF < Confidentiativy 


7 = Confidentialicy 


= Sonhdenwatiey 


Ce a | 


he « Sonhdentativy 


Screenshot 114 — IATE basic interface to create new entries 
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|.7.3. Adding Marks (comments) to an existing IATE entry 


You can also add comments to an existing entry by clicking on Add Marks at the bottom of the IATE window displaying a 
particular entry and: 

q_ Selecting the kind of problem from the dropdown list — normally form, content or merge/deletion 

q_ Selecting the relevant addressee(s) — normally your Language Department Terminologist, in my case Term PT 


and 
q_ Writing your comment 


<® IATE Help for detailed information. 


: - { Consattation | | Dats Manipulation | | Communication| | Melp} | Change Pasnward | e. 


ce Etcainrha pegs neten 4 x, manne 
‘acnologias da Informacdo @ da Comur lo 
acaba cof amas ag ] 


‘ Verse connolieds do Traade 
Funcionamanto pater lions at Th 170." 
lac orgs 1047708) mec tanaeesenoal ne 

(24. eee 12) 


Screenshot 115 — Adding comments to an existing IATE entry 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


— PART J— 
NOTES 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


J.1. Using notes @ 


You may want to write notes — in the Notes pane — about a particular segment while translating it. 


By default, a segment with notes is highlighted in BiB when that segment is not open. When it is open, the note is 
displayed in the Notes pane and the segment is not highlighted in pink. 


If you don’t want the notes to be displayed in pink, in the View menu untick the option Mark Segments with Notes 


If you want to delete a note, just delete all the text, including spaces, in the Notes pane. The note will be deleted and the 
segment will no longer be highlighted in pink. 


You now have 2 new DGT-specific shortcuts — Ctrl+Shift+A and Ctrl+Shift+B — to easily Go To the Next Note and 
the Previous Note, respectively, and opening the relevant segment for editing. 


<t)> The other <1! footnotes will not stay in the text (since they are internal instructions Goneary -€90 
only}. Muli-baneficiary General MGA = MCS geral para mulibeneficirios - 
<1) >Asvestantes'<1' >notas de pé de pagina nao se manterdo no texto (uma vez que se trata 4, Confirmar Teresa 
apenas de instrucbes intermas). 

Mone-beneficiary General MGA = MCS geral para beneficiario 
Translation last modified by machame on 12-Feb-2014 at 16:32.37 ah be 
Textin'<10!>grey<t! > indicates that text which figures in the Multi-beneficiary GeneralMGA 4 28 0) 
Is not applicable in the Mono-beneficlary General MGA. beneficiario uriccimonobeneficiario. Discuss it wth Teresa 


<segment 2787 “TRA” > 
Otextoa’<00 »cinzento'<*| >indicaque’o texto que figura no modelo geral de’ 
sesh idaahldn Gack cknn Some Ay oho A-Solase pop 
beneficiario unico. 
<end segment> . 
Screenshot 116 — Notes displayed marked with a pink background in the Editor pane when that segment is not 
open and when it is open, the note is displayed in the Notes pane. 


J.2. Generating a list of all the notes 


If you want to generate a list of all your notes, for example to give to, or discuss with, your reviser/terminologist a list of 
problematic terms/segments, press Ctrl+Shift+F6 or select the Write Notes to File script in the Tools — Scripting 
menu. 


Pret atthe an caeube do 
adds cates ebaeaascotmipeita enaia = 


Lone see pecrry ereaee parreced co pavemmes of 
See bataacee wt wet ont ex Aractor 21) or 


Petoede ete on edie 
ems 0s th ou tn Aside 9.4. Pertaeerrsesscarent eres és mem be dren rman nc ome fs 


RID li Meal OO U4 PT TRA OADOCK 1 
(ii is Fe i 


Pema oh manages Cle, 


ote 


RID D0lé Meta? 68 45 FT TRA 8 DOCK 


i 


oe tet — foe 4 poms Oahalee pete memento tas iowa che 
Pet aerate tars een. Sa ee 


ite eames rye net, pete pemmen ie WY den chneee ier 
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Screenshot 117 — List of notes (with source and target segment), by document in the project 


The list containing the notes, by document, is displayed as a table with 3 columns — notes, source and target segments 
— and automatically saved as a html file in a new subfolder (\script_out) automatically created in your project folder 
with the name of your project followed by “notes”. If you want you can copy/paste it to a Word or Excel file. 


™ Every time you run that script, OT will save a new file, with the same name, in the same folder and it will replace 
the previous one, if any. 


So, if you want to keep a list of notes, open the project folder and rename this file or move it to another location. 
Then you can run the script again without losing the previous one. 


J.3. Generating a list with some of the notes o 


You can also export only certain project notes into a list using another script available under Options — Scripting: 
Write Query Notes to File (Ctrl+Shift+F10). 


By default, if you precede some notes of the term <query> at the very beginning of the note, when you use this feature 
OT will export to the project \script_out subfolder only those notes, by document if it is a multi-document project. 


The list is displayed as an html table with 4 columns — source and target segments, query with your note and a column 
for reply. 


ata TRA-00.DOCX 


for comparable products not por produtos compariveis no 
compliant with the product conformes com o cademo de 
specification of the protected name, or|lespecificagdes do nome protegido, ov 


ELARG-2014-80031-02-01-PT-TRA-00.DOCX 


Screenshot 118 — List of some notes (with source and target segment), also with a column for replies, by 
document in the project 
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You can also have different sets of notes for different purposes. For instance, notes for the reviser, notes for the 
terminologist, notes to list the words that must be formatted in italics in the translated document in the native 
application... 


In the project used in the example below, there are many segments which have words that in PT must be in italics but 
which are not so in the source segment. As in OT it is not possible to add formatting in the target segment that does not 
exist in the source segment, it is now possible to generate a list with only a subset of notes concerning formatting. 


In this aaa notes — to formatting are preceded by <format> (instead of <query>). 


re ’ o-= ioc Litliavyv S|. 6 eS RS aT © 
tebe - FLARE 9014 G66?! of O22 fF THe 48.00Om -_ o Peeters V5 crendentine _— =a Qo 


Translation fast modified by machame on OS-Feb-201S at fa:91-St TySclos. places (lajes). ladriihos © Culras pegcas ceramicas de farinhnas 
Bricks, biccks, ies and offer ceramic Goods of siliceous siliciosas fosseis (por exemplo, kKieseigunr. tripolite. diatomite) ou de 
foest! meats (for example, kieseiguinr, tripotite or distomite) or terrae siiciosas semethantes 

of similar siliceous eartha 

<seyment T1182 “TRA = 

TYCloOs. Places (Injes). Iacriinos © outras pecas ceramicas cde 
farinheas ailiciosas foaeeie (por exemplo, Meseigunr. tripotite. 
distomite) Ou Ge terros siliciosas semeinantes <format> Kalice « kieseigunr 
end seqrment> 


meee — oo 


evo02 
oh 


Pr nteet eteeeree on tee 


Screenshot 119 — Notes to be exported by subject. In this case to generate a list of seamenils with words with 
italics in target which are not in italics in source 


To extract a list with these segments, in Tools — Scripting, highlight on the left column the name of the Write Query 
Notes to File script to be able to write in the def search field — which by default is = ‘<query>’ — the subset of notes 
you want to export and click on Run at the bottom left of the menu. 


If you run the script by clicking on the number of the script below (<10>), with the shortcut Ctrl+Shift+F10 or selecting it 
in the Tools menu, OT will continue to generate all the notes as described in the previous Section. 


& This way you can have the best of both worlds! 


WY BLAKE 5514 66691 Agreement Koes COFy FROG os 


2 seorm nants we Mien 10 )\Users\naehame\ADEUALALetONOGT \Omeget_/e)ecls\T EET -HLANS- F014 S931 -ATeSMent ROSIN CORY FRO ar 
SuUC\TEST ELANG 2014 @0031 Agresment Nesove COMY THOO. quence. Mimi 


“4° > c- <a~ | “a “0- 10- | aa 2 


Screenshot 120 — Exporting notes by subjects 


This is the list generated for the notes preceded by <format> 


LANG 2016 Ont) Oe AT PT THA OA Oe 


Screenshot 121 — Exported notes relating to formatting 


or 
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K.1. Auto-completion — General @ 


Auto-completion is a new OT feature which allows you to complete words from glossaries (in your project) or auto-text 
(your own list of abbreviations for instance), to add (special/foreign language) characters and also to manage tags by 
pressing Ctrl+Space and cycling through the options with Ctrl+Page Up/Down. 


@® Part L on Tags for information on inserting Missing tags from Auto-Completion. 


In Part | on Terminology was already explained how to use the Glossaries entries and Auto-text entries features of 
Auto-Completion. So, in this section are explained the customization options of these features and also how to use and 
customize the Character table. 


Glossary entries Auto-text entries 
the ITER activities carried out by Fusion for Energy is hereby | 
i <thi>, ; ; 
® conclusion of that Agreement projeto ITER re 7aclas bela emoresa COMurn FUSION or Eneray é 
s ITER + Reator Termonuclear Experimental Internacional 
isd poo [TER alas ala EmoresaComum usin rE, | 0d ef cf 
elebragao do referid: iva 
terebve ~ itera Autotert entries 


Press Crl+Page Down to go to Missing tags suggestions, 
Press Ctrl+Fage Down to goto Autotext entries suggestions Press Cir+Page Upto retun to Glossary entries suggestions, 
Press Ciri+Page Up to return to Character table suggestions. 


Missing tags Character table 
ITER activities carried out by Fusion for Energy is hereby appr 


nclusion of that Agreement<‘\)>, PSA 
CDE FIG MIT 1 KLiMIN oO 
: SiTiVViWXViziC di) 
Jo projeto | or Energy c ‘ figihiliy t 1 a ° 
fi s Viwixiy 2 { . 
‘a¢ao do referic tflel-lti#l [es] ale 
sion. a 
POG OUT iM Ses | 
Nissing tags SESE TE HEC ICSE Ie 
Press Ciri+Page Down to go to Character table suggestions. Character table 
- 5: Press Cirl+Page Down to go to Glossary entries suggestions. 
Press Ciri+Page Up to return to Autotext entries suggestions, ‘eas to return to Meng tags 


Screenshot 122 — Auto-completion features 
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K.2. Glossary entries configuration @ 


In the present section is explained how to customize the Glossary entries using the options available in the Options — 
Auto-completion — Glossary menu — Auto-completer Glossary View configuration submenu. 


® Section |.5 for detailed information on using entries from glossaries with Auto-completion. 


— . — 
G2 Auto-completer Glossary View configuration 2s 
~/| Display source terms 
@ Show source term first | Show target term first 
¥ Sort by source term alphabetically 
| Terms with multiple targets 
| Show longer target terms first 
| Sort target terms alphabetically 


| Follow capitalization of the typed text 


| oK || cancer | 


Screenshot 123 — Auto-completer Glossary View configuration 


The options are self-explaining and you can change these settings any time you want according to your preferences. 
Below are presented the most interesting options. 


SP Auto-corspieter Glossary View configuration 7 
wy] 
7) Dregkay source terms the Chair of the scientific advisory board of the General Assemb 
1D Shere sosorem teers free Shaws terget bere fest i] <segment 0503 MT > 
¥) tort by powrce term alphabetcalty J fe} | Presidente do 9 | we 
Oe Terri Wan reinple targets <end segment> assembly 2 contunto 
Sere Keeper Saraes eevee Set " board — conselho de administracdo 
j Wort target terms sipnadeticalty arte 
; Glossary ent 
Toow enpitnlirnhes of the typed text Press Cirl+Page Down to go to Autotext entries suggestions. 
an (c) Press Ctrl+Page Up to return to Character table suggestions. 
we Cc 


~~ 


Screenshot 124 — Default display 


{ 


GP Auto-completer Glossary View configuration 24 


7 Ceaploy source tern 


the Chair of the scientific advisory board of the General Assem| 


Ghow source term first @ Ghow target term first ' 
i 7 Sort by suurce term alphabetcally <segment 0503 MT > 
acini volik eriasioss arabe O Presidente do Conselho Ca! Jo. Cian! 
Show longer target torres firnt K <end segment> conjunto — assemb 


conseiho — 2 


ssembiy 
Sort target terines alphanesc atty conseiho de administracdo — board 


Glossary entries 
Press Ctrl+Page Down to go to Autotext entries suggestions. 
(c) Press Ctri+Page Up to return to Character table suggestions. 


Fotlow coptakzotion of the typed text 


Cad Lee |] 


Screenshot 125 — Display changed to Show target term first 


W Auto compicter Glossary View configuration 


the Chair of the scientific advisory board of the General Assembly to be esta 
<segment 0503 MT > 
Teetines wrth rruttipte Lar yets | <end segment> 
" Show longer target terms frst 
/) Sort target terms alphabencalty 


Daplay source terme 


contunto 

iconsetho 

conselno de administracéo 

Glossary entries 

Press Ctrl+Page Down to go to Autotext entries suggestions. 


Follow copdakzatvion of the typed text i (c) Press Ctrl+Page Up to return to Character table suggestions. 


(con) (cancion) | Cc 


Screenshot 126 — Display with Display source term deactivated 
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K.3. Auto-text @ 


In the present section is explained how to customize it using the options available — in Options — Auto-completion 
— Glossary — Auto-text View configuration submenu. 
® Section |.6. for information on using the Auto-text feature. 


In this menu you can add entries to a new Auto-text file, add new entries to an existing file, remove entries and load 
(other) existing Auto-text files. 


( 42 Auto-text configuration 


} | Oteptay 
| Sort by tength | 
| 
iJ Gort alphabetically \~ 
" 
| 
| . | 
Entries 
} 
| 
horteut Pull text Comment Load... jue 
[MLN Mercade Uric Tea an 
| Ice ComissSo Eur... : 
i] ut Unide furepein 
| nbs 
el nity Add 
ndash = | al 
tI lrreceety Remove 


| [eK ] | Cancel = 


Screenshot 127 — Auto-text configuration menu 


To add entries to a new Auto-text file: 

1 — In Options — Auto-completion, select the Auto-text configuration submenu 

2 — Click on Add to open a new entry and fill in, at least, the Shortcut and Full text fields. The field Comments is optional. 
3 — Click on Save and choose the location where you want to save your Auto-text file. 


© | suggest you save it in the CONFIG-PERSONAL subfolder of the OmegaT_Projects folder. 
You can also choose to have the entries, if more than one, displayed sorted by length or alphabetical order. 


You can also edit the Auto-text file and write directly on it or copy (in a batch operation) a list of entries that you already 
have from another application or delete entries no longer needed. As it is just a text only file — UTF-8, you can easily 
change it. 


MUD + Mercado -Unico -Digital-Test4 
CE - Comissdo ‘Europela 
UE + Unido:Europeia + 


nbs + °° 


nbh ~ 
ndash-—°* 
mdash-+ 


Screenshot 128 - What an auto-text file looks like in Word with some abbreviations (namely for non-breaking 
space, non-breaking hyphen, n-dash and m-dash) 


a 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


K.4. Character table o 


You can add characters to the open segment in the Editor from the list available in Options — Auto-completion — 
Character table —-Character table auto-completer options submenu. 


as 
= 
be 
: , 


im oo le 


Ss\cmys 
= rd 
— 
“amen 


S 

bila Oi De 
biglorna est 
Paar 
om 


et 
r 


f 
Allow only sninne charnctare | Clear | 


| OK | Cancel 


Screenshot 129 — Character table auto-completer options menu 


To insert — at the position of the cursor in the open segment in the Editor — a character from this list, just: 
1— Access Auto-completion by pressing Ctrl+Space and cycling to Character table if necessary. 
2— _ Select the character/symbol you want and press Enter. 


<segment 0537 **TRA** > 


A 
Q 
a 
4 


XO} + ln) be! bo |S 


Cums ©} aiex jee) N ee 


Cami ¥laly [4 iISlaloAzA 


OP rie) <e [al|o|a\o 
Chm +1) #laeK|—|<|H 


i 
= 
A 
it 


Character table 
Press Ctrl+Page Down to go to Glossary entries suggestions. 
Press Ctrl+Page Up to return to Missing tags suggestions. 


Screenshot 130 — Display of Character table for insertion of a character/symbol 
in the segment open in the Editor. 
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Furthermore, you can also customize your character table with the characters/symbols you need more frequently. 


You can add characters to your personal list in Options —> Auto-complete —> Character table —-Character table 
auto-completer options submenu: 


1— _ Tick Customize character table 
2— _ Double-click on the character/symbol you want to add to your personal list 


3— Click OK 
§E Character table auto-completer options xs 
(| Customize character table 
lAdd characters to the custom table by double-clicking, or by 
highlighting and pressing the Insert key. 
Full table 
PiQIR |s jt UV Wx HY izietA i fio 
~_ja_ |b |c jd je ff |g th ji fj |k [lL |m jn jo [=> 
Ip iq |r |js ijt luv wy ly iz il |} iw 
, Fi, i. lf 1€ |7 I» |S [« |e 
‘ ’ “ ” = baad ™ e > ¥ 
ile je [Tis F ae ik Gt fet 
o |< [2 [3 q |: 1 0 |» [% |%[% é | LT 
kee eee 
Custom table 
@lée i Tt > F lo [| 
i 
, i] 
[| Allaw anly nique characters Clear 
| ok || Cancel | 


Screenshot 131 — Custom table with selected characters/symbols 


When you press Ctrl+Space, you will only see the characters/symbols you selected to the Custom table. 


es 


<segment 0542 **TRA*™ > 


Press Ctrl+Page Down to go to Glossary entries suggestions. 
Press Ctrl+Page Up to return to Missing tags suggestions. 


Screenshot 132 — Display of the Custom Character table for insertion of a character/symbol 
in the segment open in the Editor. 


If you want to have displayed the full list again, just click on Clear to delete all characters/symbols in your Custom table. 


— PART L— 
FORMATTING: TAGS 
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L.1. TagWipe a 


Tags are no longer a problem in DGT-OT with the in-house TagWipe (for Word documents) which eliminates almost all the 
useless tags that would otherwise clutter our documents. And tags in OT almost never block the creation of the target Office 
documents. 


L.2. Tag types 


By default, your document will be displayed with formatting codes (tags) that you can insert in the target segments in the 
Editor to have your document converted into its native application correctly formatted with all the bolds, italics, 
underlines, superscripts, subscripts and also the footnote numbers in the right position. 


In OmegaT, tags are not displayed as "what you see is what you get" (WYSIWYG). Tags are displayed in grey (Example: 
<t0/>) and do not indicate what type of formatting they represent, they are just numbered. 


The tags displayed are “inline tags” which are the only ones you will see and which concern formatting inside a segment. 


For all the formatting applied to the full segment (like a whole segment in italics, in a box, with style indents, etc.) you 
don’t see any tags. However, the final document will be properly formatted. 


Introduction nt 
Introduc&éo 


The Commission recently presented a framework for climate and energy policies in 
the period 2020 to 2030<(0/'. 

4 Comiss&4o apresentou recentemente um quadro sobre as politicas de clima e de energia 
no periodo de 2020 a 2030-10) 


This framework proposes ambitious targets for greenhouse gas emissions 

| reduction and renewable energy as part of the <')/-Union<\'>'s transition toa 

competitive tow carbon economy. 

Este quadro propSe objetivos ambicioscs em matéria de reducSo das emissSes de gases 

com efeito de estufa & de energias renovaveis como parte integrante da transi¢gao da 
Unido (> para uma economia hipecarbénica competitive. 


Translation last modified by machame on 28-Jul-2014 at 14:51:33 

Italgo promotes reduced energy dependency and more affordable energy for 
business and consumers via a well-functioning Internal market. 

<segment 0004 ““TRA™ > 

Promove também a reduc&o da dependéncia energética e proporciona energia a precos 

mais © acessiveis: para as empresas e os consumidores decorrente do bom funcionamento 

do mercado interno. 

<end seqmenm> . 


tnetomary Multifle tranuiatons Cormments Gonnary 


Project autraaved on 14.00 


Screenshot 133 — Display of single (in the second segment) and paired tags (in the third segment). 


There are single tags and paired tags which are not obvious as the numbering is continuous. To see which are the paired 
tags, you can use the Auto-completion feature and insert both tags at the same time. 


You also have to indicate with a tag, in the relevant segment, the place where the footnote number will be within a 
paragraph. However, the text of all the footnotes is displayed at the end of each of the documents for translation (much 
like end-notes) in the OT Editor. 


If you insert the footnote tag correctly, when the document is converted into its native format, the footnote number will be 
displayed in the correct paragraph in the correct position and the text of the footnote will be at the bottom of the right 
page. 

An easy way to see what inline tags really are — and how they affect the translated documents in the native 
applications — is to use the Remove Tags option as seen in the example in Figure 2 below. | translated a document with 
lots of formatting — bolds, bolds/italics, boxes, bullets, and style sheet formatting — with Remove Tags activated — and 
the formatting was converted correctly with the exception of: 


y The bold in "strategies on gender equality" and 
y The position of the number of footnote 45 within the paragraph. 
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suffers from a considerable los m pes 

increase in female researchers is lene then half the ental paseo of female PhD graduates 
and too few women are in leadership positions or involved in decision-makang In 2005 the 
Council set a goal for women to be in 25% of leading public sector r positions, but m 
2009 only 13% of the heads of hagher education sxtibutions were wo: integration of 
a gemder dimension into the design, evaluation and implementation hh is also still too 
muted 


The challenge is to improve on all these points to increase the quality and relevance of 
research The Commission is already committed to ensure 40% of the under-represented sex 
in all sts expert groups, panels end commuttees and will apply this particularly under Homzon 
2020. 


Create « tegal and policy errvironment and provide incentives to: 


remove legal and other bamiers to the recrustment, retention and career 


address gender imbalances in decision making processes 

strengthen the gender dimension in research programmes 
Engage in partnerships with funding agencies, research orgemusations end universities 
to foster cultural and mostitutional change on gender - charters, performance 
agreements, awards 


ea ptr ee car eR SPS NE PT Sp Sr 
involved in recruitment/career progression and in establishing end evaluating 
research programmes 


Research stakeholder organisations are invited to: 
. Implement institutional change relating to HR management, funding, decision. 
making and research programmes through Gender Equakty Plans which aim to: 
— Conduct impact assessment / audits of procedures and practices to identify 
gemeder bias 
— Implement innovative strategies to correct any bies 
— Set targets and monitor progress vie mdicators 
The Commission will: 


. Peeetathccregtie, yor Bre orbs ter al enero penaeree Sen eager HOS 
1 from inc ' 


SHE Figures 2209 
See Directzre DOB.S4EC 


ENGLISH ORIGINAL PORTUGUESE TRANSLATION — DOCUMENT GENHRATED WITHOUT 
INSERTING INLINE FORMATTING (TAGS) IN OMEGAT. 


2A. Igualdade ent éneros ¢ int =f “rspe ti enero na investigacao 


Apesar da existéncia dij estratégias de igualdade entre gineros § nivel nacional ¢ da UE, o 


setor da investigss So a ¢ utilracdo ineficiente de 
mulheres altamente qualificadss ° aumento aruaal do tamero de mulheres na mvecigacgio ¢ 
inferior a metade do mimero anual de doutorados do sexo feminino ¢ hé muito poucas 
mulheres em cargos de lideranca ou de decisio. Em 2005, o Conselho fixou 0 objetivo de um 
nivel de 25% de mulheres nos principms cargos de uvvestign¢ ao no setor publico, mas, em 
2009, apenas 13% dos chefes de entos de emsino supenor eram mulheres. A 
integracéo da dimensio do género n So, avaliacgdo ¢ implementagio da investigacao 
Cortana 4 ser amnida demasiado brute 


© desafio ¢ uma melhona em todos estes aspetos a fim de aumentar 4 qualidade ¢ 4 relevancis 
da investigasio. A Comissiio j4 se comprometeu a assegurar 40% do sexo sub-representado 
em todos os seus grupos de peritos, paintis ¢ comités ¢ eplicard este critério especialmente no 
&mbito do Programs-Quadro Honzonte 2020. 


Criar um ambiente juridico e politico ¢ proporcionar incentivos a fim de: 


Eliminar os obstéculos juridicos ¢ outros ao recrutamento ¢ progressio na 
carreira dos imvestigadores do sexo feminino ¢, 80 mesmo tempo, respeiter 
plenamente « legisiagto da UE em matéria de igualdade dos gé&eros"® 
Abordar a questo dos desequilibrios entre géneros nos processos de tomada de 
decsto 
Reforcar a dimensio de género nos programas de investigag$o 
Participar em parcenes com agencies de financiamento, orgeruzacSes de mvestigagao 
e universidades a fim de promover mudancas culturais ¢ instituctonais em maténa de 
g@émero - castes, acordos de desempenho, prémios 


Assegurar que, pelo menos, 40% do sexo sub-representado participe em comités 
envolvidos no recrutamento/progressio na carreira ¢ no estabelecimento ¢ avaliacio 


dos programas de investigag&o 
As organimodes de partes interessadas na investigagio silo convidadas a: 


. Implementaz mudangas imstitucionsis em maténa de gestdo de recursos humanos, 
financiamento, processo de tomada de decisho ¢ programas de investiga; So mediante 
planos de igualdede de géneros com o objetivo de: 

- Proceder 4 avaliacgio de impactofauditoria de procedimentos ¢ priticas a fim de 
identificar disenminacéo de género 
Implementar estraté gies inovadoras para cormgy eventiais discriminagdes 
Definir objetivos e acompanhar os progressos através de indicadores 


Ver « Diretrra 2006/4 CE] 


Figure 2 — Example of a document translated without inserting the formatting (inline tags) in DGT-OmegaT with Remove Tags activated 
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L.3. Tag insertion and deletion and strip tags @ 


There are several ways to insert tags: 


1) Inserting next missing tag: OmegaT now offers the possibility to insert tags sequentially at the position of 
the cursor. 


So, to insert the first tag in a segment, just place the cursor in the position you want the first tag to be and 
press Ctrl+T. The first tag will be automatically inserted. Repeat the process for the following tags 
sequentially. 


© For me, most of the times this is the easiest and quickest way to insert tags. 


2) Inserting paired tags using Auto-completion: Also a new Omegat feature. You can highlight the string 
you want to format in the open segment in the Editor, press Ctrl+space — cycle with Ctrlt+space if 
necessary — and in the Missing tags dropdown menu, select the pair (or the tag) you want. 


Option 2<(0/> (<ti/>tuning ranges<(2/>) 
<segment 0351 ““TRA™ > 


<end segr=19 
<to/> 


Press Cirl+Page Down to go to Character table suggestions. 
Press CtrlePage Up to return to Autotext entries suggestions. 


Screenshot 134 — Inserting (paired) tags using Auto-completion 


3) Choosing each tag in the dropdown menu: Position the cursor where you want to insert the tag and click 
on the mouse right button and a dropdown menu will appear in which you can choose the tag to insert by 
clicking on it. 


Option 2<(0/= («() =tuning ranges<(> >) 
| <segment O357 “"TRA™ > 

(Opc¢&So 2 (gamas de sintonizac&o) 

<end segment> 


Paste 


65 million Add glossary entry 
65 milhées 
| Set ermnpty translation 


| 44 million Remove transiation 
1 mines Register identical Transiation 
Insert Tag <t0/> 
Insert Tag <ti/> 
Insert Tag <t2/> 


| -18 thousand 
-15 milhares 
| 
tectonary Machine Transiabon tMuttigte 1 


Prabet autedaved ah SER Create Alternative Translation 
_——s 


Screenshot 135 - Insert tags via the dropdown menu 


4) Insert missing source tags: Press Ctri+Shift+T to insert all the source tags at the position of the cursor in 
the target segment. If there is only one or several in a row, this is an easy way to do it. 


5) Icon: Click on the respective icon (Insert source tags); this is another way to insert all the tags (if more 
than one) in the target segment at the position of the cursor. 


6) Drag&Drop or Copy/Paste: Select the tag(s) on the source segment and drag and drop it/them — or 
copy/paste them — in the target segment of the Editor pane. 


Also a new feature is that, if you have a target segment from Fuzzy Matches with lots of tags, but the wrong ones or in 
the wrong places, you can just press Ctrl+Shift+F5 — the Strip Tags script in the Tools menu — and it will clean all 
the tags in the target segment and you can insert the tags on a “clean” segment. 


i 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


L.4. Working without any tags — Remove tags 


If you really hate to see tags, you can work without any tags at all in DGT-OmegaT by activating Remove Tags in the 
Project —> Properties — Edit project menu and afterwards insert the formatting in the final document in its native 
application. 


You can deactivate Remove tags at any time — as the tags in the original are not displayed, but they have not been 
eliminated. 


In normal documents it will just take you a few minutes to check/insert the formatting in an Office document, and it can 
even be done by the Secretariat. 


™ | suggest that you don't switch from tags/no tags in the middle of the translation of a document/project, since 
tags affect segmentation and match rates and you will have previously translated segments that will become 
untranslated (orphans) as they will no longer have a 100% match in the project memory, due to the penalty for 
tags. 


However, if you really want to switch from no tags<> tags for some reason, those translated segments are not lost. 
They are in the project memory and will be displayed in the Fuzzy Matches pane as “orphan segments” when you 
open that segment. 


L.5. Validate tags o 


After finishing your translation, or at any time during the translation process, to check if OT detected any missing or 
wrong tags either: 


y Validate tags in your whole project: Pressing Ctrl+Shift+V or selecting Validate Tags in the Tools menu 
or clicking on the icon 16 (Validate tags), or 

y Validate tags in your current document: A new OT feature is that now you can press Ctrl+Shift+J 
(DGT-OT specific shortcut) or select Validate Tags in the Tools menu 


By clicking on the number of the segment in the window that is displayed on the left, OT will jump to the segment in 
question and open it in the Editor pane and you can correct it if needed. 


& | Entries with modified tags ies MEP el) 
ET Ts 
and regulating Switzerland’s participation in the ITER € que rege a participacdo da Sulca nas atividades do projeto 
activities carried out by Fusion for Energy is hereby ITER realizadas pela Empresa Comum Fusion for Energy Missing 
approved on behalf of the European Union, subject to the © aprovada, em nome da Unido Europeia, sob reserva da => Fix 
conclusion of that Agreementst0/> celebragao do referido acordo 
| 55 This Decision shall be published in the <t0/>Official Journal |A presente decisdo € publicada no Jomal Oficial da Unido Missing 
of the European Union. Europeia = Fix 
| 115 The proposal/nitiative relates to <t0/>an action redirected (A proposta/iniciativa refere-se a uma agdo reorientada para = Missing 
‘towards a new action uma nova acdo > Fix 
Medidas politicas que visam promover a cooperacdo entre a 
<t0/>Policy measure to encourage cooperation between VE ue Confederarao Suiga, tendo 4 cone & importéncia Ga 
the EU and <t1/>Euratom and Switzerland in view of the investigarao C&T para as Partes € a implementarao conpane 
importance of S&T research for the Parties <t2/> <t3/>on- ri apr esa Ge lnvesigarao de ate penn & Missin 
120' going joint<t4/> <t5/>implementation of research x spans COOpOER SO: are - oi. 
: ‘ realizadas no 4mbito do Programa-Quadro Horizonte 2020, do > Fix 
programmes of mutual interest, to cooperate and give : 
acoass 60 acindlies carried Gat Hoslzcn 2000. Euraions Programa Euratom ITER e do Desenvolvimento da Energia de 
ITER and the Development of Fusion for Energy (F4E) Fusao para a Producao de Energa F 
L 
F 


Screenshot 136 — Tags validation by document or for the whole project 


1] 
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L.6. Tags not to be ignored ! 


The Validate tags check is sometimes overcautious. There are tags that you may ignore... but others not! 


There are tags — apparently useless — for which it is better not even thinking of what they mean. Just insert them, in the 
same position, in the target segment or otherwise it may happen that your translated documents will not be generated. 


Always insert tags that are: 
y  Atthe beginning of the segment (probably due to poor segmentation). 


y Before and after a full stop in the middle of a segment (also probably a segmentation problem or poor 
original). 


Be also careful: 
y Not to repeat tags, i.e. not having more than one tag with the same number 
y Notto miss a tag at the beginning or the end of a word formatted in bold, italics, underline, for instance. 


These are paired tags and, if one is missing, it may happen that OT will not be able to generate the document 
in its native application. 


On the other hand, if you don’t insert both paired tags, the only effect is that the formatting will be lost in the 
translated document in the native application. 


y Not to miss a footnote tag as in that case the footnote — which you translated in OT — will not be in the 
created translated document in its native application. 


You can easily spot the footnote tags by hovering over them. 


Having regard to Decision No 676/2002/EC of the European Parliament and of the Council of 7 March 2002 on a regulatory framework for radio spectrum policy in the European Community 
(Radio Spectrum Decision)<!> », and in particular Article 443) thereof, 


Terdo am conta @ Deciséo n,° 676'2002/CE do Parlamento Europeu @ do Consetho. de 7 de marco ce 2002, relativa a um quacro requizmentzr pare a politics co espetro de radiotrequercias na 
Comunidede Europela (Decisic Espatro de Raciolrequéncias}=1), nomeadaments o artign 4° n.°3 


6 ftete </tr> cuct> curr curs woal= FoomnareRetenarc2" /»<fn7?e> <wrtoomnoReterence wods"]'/></mr> <i> <wt amispsce=" 


Screenshot 137 — Description of a footnote tag 


However, if you are working with Remove Tags activated, the footnotes will be in the right place at the bottom of the right 
page and the number of the footnote will be in the right paragraph (at the end of the paragraph) ... but maybe not in the 
right position. You will have to check and, if necessary, change it in the translated document in its native application. 


If you ignored other tags, the worst that can happen is having some formatting missing in the final document. 


If you create one or more translated documents and you cannot open them in their native application because there is 
an error, run Tag Validation again, correct the segments in which there are tags mismatches and create the translated 
documents again. 
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— PART M— 
SPELLCHECKER, LANGUAGE 
CHECKER AND QUALITY 
ASSURANCE 
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M.1. Spellchecker @ and Language Checker 


OmegaT uses the LibreOffice dictionaries for the spellchecking and language checker features. 


You can update your dictionaries with added or ignored words both in the Editor (for each segment) or with the 
Spellchecker feature (for the whole project or for a document). 


In DGT-OT, when you add learned or ignored words for the first time, 2 files are created in the CONFIG-PERSONAL 
folder: — ignored_words.txt and learned_words.txt — for each language pair. 


The update is recorded in those language-specific “dictionary” files which are used for all your projects. 


For translators of the Portuguese Department: 


The LibreOffice speller is quite good but FLIP is the one we are using in case of doubtful interpretation of the Spelling 
Reform 1990 or of double spelling. 


Therefore, as the LibreOffice dictionary (used by OT) does not behave in exactly the same way as the Word dictionary, 
perform the spellchecking also in Word before giving your translation to the reviser and/or sending it to the requester. 


M.1.1. Spellchecking and Language Checking in the Editor 


The words the LibreOffice dictionaries do not recognise are indicated with a red wavy line below them, just like in 
Office. 


If the speller indicates a misspelled or unknown word and if you want to see its suggestions, just right click over the 
word in question and choose the word in the dropdown menu that opens if you want to correct it with any of the 
suggestions displayed. 


You can also ignore or add a word to the dictionary by right-clicking on the word in the target segment open in the 
Editor and clicking on Ignore all or Add to Dictionary. 


Shaping the right environment for digital networks and services to flourish 
<segment 0034 ““TRA™ > 
Criagao de um ambiente propicio ao desenvolviiment* 27> “7707 ~ coniann sinitain 


desenvolvimento 
<end segment> ee 
desenvolvente 


desentendimento 


Strong European data protection rulestoboostt — ignore Ail 
Regras europeias de protecdo de dados solidas pare Add To Dictionary 


72% of Internet users <(0/>in Europe still worryt — ~ toom 
72 % dos internautas <!0/>na Europa mostram-se ait =~” Ihes se 
dados pessoais em linha ones 
Add glossary entry 
Rolling out fast broadband for all 


Disponibilizacdo generalizada da banda larga rapida «SPY Translation 


Remove translation 
Register Identical Translation 


Take-up of fast broadband is low: 
O nivel de implanta¢ao da banda larga rapida é baix: Use as Default Translation 
Dictionary Comments Create Alternative Translation 


Screenshot 138 — Example of a word misspelt/not recognised in the dictionary (in this case it is misspelled). 


1] 
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Grammar inconsistencies are indicated by blue wavy lines. Therefore, you do not have to explicitly call the speller and 
grammar checker. Just let the cursor a while over the word(s) with blue wavy underlining and the grammar suggestion 
will appear as a message in the target language of your project. 


references to the name of the Member State in which the wine, spirit drink and aromatised wine originates or other names to 
indicate the Member State, 

<segment 10516 ““TRA™ > 

‘95 termos que se refiram ao Estado MMembro de que o vinho, a bebida espirituosa € o vinho aromatizado sdo originarios ou outros termos 


que designem © Estado-Membro, Esta palavra é hifenizada 
<end segment> 


Screenshot 139 — Example of a grammar suggestion given in the target language. In this case, it calls the 
attention to the fact that this word should be hyphenated. 


In the Options menu, you can deactivate the highlight of unknown words or grammar suggestion if you don’t want to 
see it in each open segment by unticking the option Language Checker. 


M.1.2. Spellchecking for the whole project 


Furthermore, now you can check the spelling in all the documents of your project — with Ctrl+Shift+F7 or selecting 
that option in the Tools menu — and update your dictionaries of ignored or learned words. 


You also have some other options, the most interesting being the possibility to use the project glossary(ies). 


You can click on Ignore or Learn to add it to the relevant dictionaries or click on the number on the left column to 
correct a mistake in a particular segment. 


™ Don't forget to validate the segments you change with Return. 


Target 
o i 
PQ 
EUR (Ignore Learn 
biorrefinarias Ignore _}} Learn 
RTD (Ignore Learn 
biorrefinarias Ignore Learn 
Nanociéncias | Ignore Learn 
BIOCORE \_Ignore Learn 
EUROBIOREF (Ignore Learn 
SUPRABIO Ignore Learn 
EUR (Ignore Learn 
bioinddstrias Ignore | Learn 
biorrefinarias Ignore Learn 
biorrefinarias (Ignore __})(__Learn 
bioeconomicos {__Ignore Learn 
bioinddstrias Ignore Learn 
bioindustrias Ignore Learn —" 
ITc {Ignore Learn 
PPP Ignore Learn 
bioindustrias Ignore Learn 
ibioindustrias (___Ianore Learn a 
Check whole project Use glossary “| Ignore glossary case 
Replace escaped sequences with space p 
(\a, \b, \F, \n, \r, \t. \v) ¥| Replace custom tags with space 
=) Remove OmegaT tags which have al . a ral 
Theres eee een pe eee v¥| Remove mnemonics (&~) |¥| Remove defined fragments 
[ Refresh 


Screenshot 140 — Spellchecker for the whole project 
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M.2. Quality Assurance — Check Rules @ 


Using the script QA — Check Rules in the Tools menu (Ctrl+Shift+F3), you can check your documents for a series of 
possible errors: check for inconsistent numbers, shorter target, equal source and target, untranslated segments, etc. 


The options are self-explaining. You can check in the whole project or only in the current document and you can also 
check for spelling errors at the same time. 


By clicking on the segment number in the left column, OT will jump to that segment opening it in the Editor. 


™ Don't forget to validate the segments you change with Return. 


* 


Segment Rule 


1412 __)|Different punctuation 


Target 
A proposta/iniciativa requer uma reprogramacao ... 


Source 


Proposal/initiative will entail reprogramming of th... 


(1415 __ Different punctuation 


Explicitar as necessidades, especificando as rubric... 


Explain what is required, specifying the headings ... 


1482___)|Different punctuation 


JO L 298 de 26.10.2012, p. 84 


OJ L 298, 26.10.2012, p. 84. 


122 Different start case 


O presente requlamento limita-se ao minimo exigi... 


this Regulation confines itself to the minimum req... 


200 Different start case 


[Em qualquer litigio entre os membros relativo ao ... 


in any dispute between the Members which relate... 


281 Different start case 


A Uniao, representada pela Comissao, 


the Union, represented by the Commission, 


293 Different start case Conselho de Administracao; the Governing Board; 
294 Different start case Diretor Executivo; the Executive Director; 
295 Different start case Comité Cientifico; the Scientific Committee; 
{ 296 Different start case Grupo de Representantes dos Estados; the States Representatives Group; 
331 Different start case Aprovar as contas anuais; approve the annual accounts; 


335 Different start case 


Aprovar a lista de acdes selecionadas para financi... 


approve the list of actions selected for funding; 


361 Different start case 


Preparar e apresentar para adocao pelo Conselho... 


\prepare and submit for adoption to the Governing ... 


365 Different start case 


Assinar decis6es ou acordos individuais; 


sign individual agreements or decisions; 


387 Different start case 


Aconselhar sobre as prioridades cientificas a integ... 


advise on the scientific priorities to be addressed i... 


| 402 Different start case 


Atualizacao das orientacdes estratégicas; 


lupdating of strategic orientation; 


403 Different start case 


Ligacgdes com o Programa-Quadro Horizonte 2020; 


links to the Horizon 2020 Framework Programme; 


404 Different start case 


Planos de trabalho anuais; 


annual work plans; 


| 787 Different start case 


«Despesas administrativas» - Competitividade par... 


Competitiveness for growth and jobs ‘Administrati... 


59 Doubled blanks Por consequinte, as probabilidades de sucesso ser.../The pooling and coordination of research and dev... 
77 Doubled blanks no entanto, as a¢des iniciadas ao abrigo do Regul... |however actions initiated under Regulation (EC) N... 
1336___|Doubled blanks Contribuicdo para as misses e atividades da Em... |Contribution to the tasks and activities of the Fuel ... 
1338___}|Doubled blanks Contribuicdo para as misses e atividades da Em... |Contribution to the tasks and activities of the Fuel ... 
(1221 Doubled words XX 01 01 01 (na sede e nos gabinetes de represe... |XX 01 01 01 (Headquarters and Commission's Re... 
1222 __)|Doubled words XX 01 01 02 (nas delegacdes XX 01 01 02 (Delegations 
Check whole project ¥ | Check for leading whitespace | Check for shorter target 
¥| Check for segments with spelling errors | Check for trailing whitespace | Check for longer target 
| Check for doubled blanks | Check for equal source & target 
| Check for doubled words v) Check for untranslated segments 
| Check start case | Check number of tags 
¥ | Check punctuation at segment end | Check spaces around tags 
¥| Check for inconsistent numbers | Check sequence of tags 


Refresh 


Screenshot 141 — Quality Check for the whole project — or for a document in the project — with multiple options 
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M.3. Quality Assurance — ApSIC XBench 


DGT translators can also use the free-ware version of XBench (version 2.9) which is installed in our computers. 


For DGT translators, ApSIC Xbench is interesting mainly for its Quality Assurance (QA) features: 
y Find untranslated segments 
Find segments with the same source text and different target text 
Find segments with the same target text and different source text 
Find segments whose target text matches the source text (potentially untranslated text) 
Find tag mismatches 
Find number mismatches 
Find double blanks 
Find repeated words 
Find terminology mismatches against a list of key terms 
Execute user-defined checklists. 
Spell-check translations 
Checklists are user-defined searches that you can run in batch against your ongoing translation. 
For example, with check lists you can search for banned words or typical translator pitfalls. 


i i Ci, Ci, Ci, Ci, Ci, Ci, Ca, Gi. 


Although OmegaT QA already performs a substantial number of these quality checks, as you can see from the list 
above there are still some missing (in bold). 


So, if you want to do an in-depth quality check — particularly for large projects with several translators — it may be 
worthwhile to use XBench. 


® See the XBench Guide in XBench Help for detailed information. 
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— PART N— 
REVISION WITH DGT-OMEGAT 
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N.1. General 


For the revisers who don’t revise in the CAT tool, nothing changes. The generated translated documents can be used as 
usual, either printed or in electronic format. 


For projects in which the reviser wants to revise using DGT-OT, it is possible but with some manipulations as there is no 
automated workflow for the moment. 


In Section A.9 — Revising in DGT-OT in a nutshell ! — is described the simplest workflow for revision in the standard — 
and most frequent — situation in which a (single or multi-document) project is translated by 1 translator and revised by 1 
reviser and the whole project is sent for revision at the same time. 


That may be enough for all (the majority of) your projects... and in that case you don’t even need to read 
this section! 


In Section D.1.21. — Preparing a project for revision in DGT-OT — is further explained how the translator can prepare a 
project to be sent to the reviser, both with all the documents in the project or with only a part of them. 


In this Section is given a global and detailed explanation of DGT-OT features which are relevant for the revision process 
so that you can deal with different situations choosing the more adequate approach to your particular project and work 
method. 


Preparing projects for revision is, in fact, just a matter of moving/deleting and renaming some memory files. 


Both the translator and the reviser will be working in DGT-OT in the same “translation” mode with only a few small 
differences and they can communicate between them via: 


a) The Notes that can be added by either of them to particular segments. 
b) The project writable glossary, if any. 
In the subsections below are presented: 
y The things to keep in mind so that you can choose the most adequate method for a particular situation. 
y Workflows to cover other frequent (or less frequent) situations in our complex work environment. 


But of course you are free to do it any other way that suits you. OmegaT Is very flexible and simple to manage and maybe 
you will find a better way adapted to your particular project and context! 


The same workflow can be used, mutatis mutantis, for the revision of freelance translations, if tmx files are provided by the 
freelance translators. 


Take into consideration that, in OT, there are no track-changes in the (open or closed) target segment(s) displayed in the 
Editor. There are only track-changes in the segments displayed in the Fuzzy Matches pane which, in the revision stage, 
are from the “draft” translation that is being revised (displayed first for the same match rate) and possibly from other 
external memories. 


As it is possible to display track-changes in the target segments too, the translator who is checking the changes made by 
the reviser — to accept them or not — can see those track-changes in the Fuzzy Matches pane for the segment that is 
open in the Editor. 


Also take into consideration that, if you — as the translator — have the last word, you cannot accept some of the reviser’s 
changes and reject others in a particular segment. You can either accept all the changes made by the reviser or change 
manually the ones you don't accept. Or you can also reinsert your initial translation (from the Fuzzy Matches pane) and, if 
adequate, do the partial changes to your initial translation manually. 
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N.2. Revision process — Things to keep in mind 


To prepare a project for revision the translator can just copy the project used for translation and rename it; copy and 
rename the project memory and copy the project to a server location to be used by the reviser, as explained in Section A.9 
and this may satisfy most or even all of your needs. 


In these sections is explained how you can manage the revision workflow for more complex projects involving many 
documents/pages, several translators and sometimes several revisers with the translation and revision work being cone in 
several stages (or in “cascade”) and frequently with several new versions while the documents are being translated. 


For those projects, it may be worthwhile to select only some documents of a project for revision, shuffle some memories 
around and delete files which are of no interest to the reviser. 


N.2.1. Things to keep in mind concerning partial project memories 


If you, as a translator, want to send for revision just one or some of the documents in a multi-document project, you can 

choose to extract — from the global project memory — just the segments present in the document(s) you want to send for 

revision. 

& This may be important if you have documents in different translation stages — some ready for revision and others 
still in a “raw” state — and you don't want to give the reviser the global memory that may display, in the Fuzzy 
Matches pane, below 100% match segments in your project memory that are too “raw”. 


You can generate the individual document memory(ies) pressing Ctrl+Shift+F9 (or Tools — Create OmegaT Export). 


That/those memory(ies) are saved to the project \export-omegat folder and can be copied to the \tmlauto subfolder of the 
project for revision. 


N.2.2. Things to keep in mind concerning memories in the \tm\auto 
folder 


®) Section F.7. for more detailed information on Pre-Translation. 


1— _ The translator or the reviser can copy any number of memories of documents for revision to the tmlauto subfolder 
of the project at any time — even during the revision process — and the 100% match segments (including 
formatting) from those memories will be transferred to the revision project memory. 


Those segments will have the translated status and are displayed in the Editor pane with an orange background as 
auto-populated segments if, in the View menu, the option Mark Auto-populated Segments is activated (the 
default) and if, in the Option — Editing Behaviour options menu, the option Save Auto-populated Status is 
activated too (also the default). 


™ If you — either as reviser or as the translator who afterwards finalizes the project — don't like to see the orange 
background, you can deactivate the option Mark Auto-populated segments in the View menu and the orange 
background will not be displayed. 


You can reactivate it again later on, as long as you don't deactivate the Save Auto-populated Status. If you do, 
that action cannot be undone and the auto-populated segments can no longer be displayed with an orange 
background. 


: | 
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In the Fuzzy Matches pane will be displayed first the segment from the file(s) in the \tmlauto folder (the “draft” 
translation(s) in this case) and after that the segments from external memories, if any. 


As OmegaT only keeps the last modified translation of each segment in the project memory, this way the reviser 
and the translator can always check the segments revised against the initial translation displayed in the Fuzzy 
Matches pane. 


Lower than 100% matches will not be transferred to the project memory and will be displayed in the Fuzzy 
Matches pane like any other match, but will be displayed (for the same match rate) before other matches in the 
memories in the \tm folder, if any. 


This may happen if a new version of a particular document has arrived while the project is already being revised 
and the reviser updates the project with the new version(s) and simultaneously revises and introduces the changes 
of the new version, something which can happen in our work environment. 


If more than one memory is copied to the tm\auto subfolder and there are several identical source segments with 
different translations (but which are not defined as alternative translations), OmegaT will transfer to the project 
memory the translation of the last occurrence in the last memory (alphanumeric order) in the tm\auto subfolder. 


The other (conflicting) translations will be ignored. They will be displayed in the Fuzzy Matches pane when that 
particular segment is opened. 


This may happen in the case of projects with documents translated by 2 or more translators as in that case — even 
when working in share mode with TeamBase — inconsistencies may crop up in repeated segments in the global 
project which has been translated by different translators in separate OT projects. 


To give priority to one or more memories, it is just a matter of renaming the memories in the tm\auto subfolder by 
(re)numbering them in an alphanumeric inverse order so that, in case of different translations for an identical 
(non-unique) segment in the project for revision, the translated segment transferred to the project memory comes 
from the (alphanumerically) last tm\auto memory (from the translator) to which priority is to be given 


Example: If there are memories from translators A, B and C, to give priority to the segments coming from translator 
B in case of identical source segments, just rename the memories giving it the last number: 


1-TRANSLATOR-A; 2-TRANSLATOR-C; 3-TRANSLATOR-B 


and the translated segments from Translators A and C will be ignored and not transferred to the project memory. 
They will be displayed in the Fuzzy Matches pane like any other match, before segments in other memories in the 
\tm folder (if any). 


™ This is the opposite of what is done when organising external translation memories to be displayed in the Fuzzy 


be 
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Matches pane or to be used with Search! 


For large projects with several translators it may be important to establish the hierarchy of prevailing translation 
memories in the tmlauto folder if the documents (or part of a document) have some/many repeated segments 
(i.e. 100% identical including tags and — for alternative translations — segments with the same previous and 
next segments). 


If there are (many) segments with alternative translations so defined, OT will take the previous and next segments 
of each alternative translation into account and transfer to the project memory the last occurrence in the last 
memory (alphanumeric order) in the tm\auto subfolder which has the same previous and next segments. 


Keep in mind that, in DGT-Omegar«, "alternative translations" are document-independent by default, i.e. OT accepts 
that status for non-unique segments in all the documents of the project without “looking” at the number of the 
documents (which is recorded) and only “looking” at the previous and next segments. 


The reason for this default in DGT-Omegat is that frequently there are new versions of documents arriving during 
the translation or revision process and alternative translations from a previous version of a same document would 
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not be taken into account if the “alternative translations” were document-dependent as the number of the new 
version would be different. 


If you want to make it document dependent, just go to the Options — File Filters menu and unclick the option 
Ignore File Context when identifying segments with alternative translations. 


If the reviser had already started the revision and adds another memory to the \tm\auto folder, only the 100% 
match segments (including formatting/tags) for segments that are not in the project memory already (in the 
\omegat subfolder) — i.e. segments with the untranslated status — will be transferred to the project memory. 


The segments already in the project memory have prevalence and will not be changed. 
The only exception to this “rule” is if the enforce feature in OmegaT is used. “> Section F.9. 


Keep in mind that any changes the reviser makes will be auto-propagated to all non-unique (repeated) segments in 
all the documents of the project unless the status of one or more instances of non-unique segments is set as an 
"alternative translation". 


N.2.3. Things to keep in mind concerning segment display and 


1— 
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identification 


The reviser can open all the segments or just the ones he wants to change (View mode with Display Source 
Segments activated). 


Only the segments that the reviser opens and changes will be identified with his/her login and will no longer be 
displayed with an orange background. The others will remain with the login of the translator and with the orange 
background. 


At the end of his/her work, the reviser can check the changed segments just by scrolling in the Editor and looking 
at the segments that are no longer with an orange background. 


If the reviser wants to check the changes he has made with track-changes displayed, he can do it by ticking — in 
the Options — External TMX Options menu — the box View diff in target. 


When he opens a revised segment in the Editor, the track-changes will be displayed in the target segments (from 
the “draft” translation being revised) in the Fuzzy Matches pane. 


Don't forget — after finishing the revision — to untick that box if you are afterwards going to do translation (and not 
revision) work. 


If the reviser wants to quickly check the changes he has made — or if the translator wants to quickly check the 
changes made by the reviser — in large projects without having to scroll, they can filter those segments by doing a 
Search by: Regular Expressions (Partial segment) in the Memory only, writing a dot (“.”) in the field In 
translation and his login (respectively the reviser's or the translator's login) in the field Translator. 
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Screenshot 142 — Searching and Filtering — by the reviser- of the revised segments, by login of the reviser 


(in this example machame-REVISOR) 


Be 
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™ Don't forget to tick the box in those 2 fields (and to untick them when you no longer need it). 


Those segments are displayed in the Search window and the reviser or the translator can filter them for editing in 
the Editor or just click on the number of a particular segment to open it in the Editor. 


5 — The translator can open the segments changed by the reviser using the feature Go To Next/Previous Revised 
Segment (View menu) or with its shortcuts Cérl+Shift+X and Ctrl+Shift+Y. 


With this feature, what OT does is go to and open the next/previous segment that is identified with the login of 
another user, which will normally be the reviser’s login. 


The translator's “draft” translation is displayed in the Fuzzy Matches pane and the track changes are 
automatically displayed in the target (and not in the source) segments. 
a Don't forget to deactivate the Mark Revised Segments option if you are afterwards going to (continue to) 
translate a new project as this option will remain active, even in other projects, until you deactivate it. 
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Screenshot 143 — Viewing the changes made by the reviser with the Mark Revised Segments feature 
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Screenshot 144 — Searching and Filtering — by the translator (with Mark Revised Segments active) — of the 
revised segments, by login of the reviser (in this example machame-REVISOR). 


6— If the translator used pre-translation from previous reference documents during translation, the pre-translated 
segments that he opened (or not) but didn’t change will continue to have the login of the translator, if any, in the 
memory from which they were pre-translated. If the reviser doesn’t change those segments either, they will 
continue to have that same login. 


In that case, when the translator is checking the reviser’s changes, the Go To Next Revised Segment (i.e. 
segments which has a different login from the translator) will also open those segments with logins from other 
translators in the memories used for pre-translation. 


& In that case it is better to see the changes made by the reviser using Search and Filter as explained above. 
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N.3. Standard revision workflow of a complex project 
with 1 translator and 1 reviser 


If you have a project with a number of documents, new versions and which is not sent for revision at the same time, it 
may be worthwhile to use a different workflow of the one explained in Section A.9. 


Here is explained in more detail a standard workflow for complex projects. 


N.3.1. The translator prepares the project for revision 
In the original translation project (open in OT): 


1— Close any generated translated documents from that project that may be open in their native applications. 


2— Generate the translated document(s) by pressing Ctrl+D. If your reviser wants the documents printed too, print 
them. 


3— Copy the translated document(s) to Tradesk (using Tradesk Upload feature) so that they are available there to 
the reviser... and to everybody else. 


4— You can also export any notes you may have written with the command Ctrl+Shift+F6 (Tools — Write Notes 
to File script) which will be saved in the \script_output folder generated in your project folder when you use 
this feature. 


This way you can give the notes to the reviser on paper or in electronic format to discuss them. Anyhow, they 
will be displayed when the reviser opens a particular segment which has a note. 


5— Generate the project translation memory(ies) to be used for revision by pressing Ctrl+Shift+F9 (or Tools > 
Create OmegaT Export) and selecting the document(s) you want in the window that pops up. 


OT will generate individual memory files — with notes and alternative translations, if any (but without orphan 
segments) — to the project \export-omegat subfolder. 


6— Close Omegat. 


In Windows Explorer, with Ctrl+C and Ctrl+V, do a copy of your project (to that same OmegaT_Projects folder) and 
rename it (for instance {name-of-the-project}-FOR-REVISION). This way, your original project will remain intact and 
you can always reuse it if you want. 


In the copy of the project for revision: 
7—_ In Windows Explorer or via the DGT-OT Wizard, open the project folder. 
8— Copy the memories generated in the \export-omegat folder to the \tmlauto folder. 


You may want (it’s optional) to rename it/them (for instance, {name-of-the-document}-DRAFT) so that there is 
no doubt that these are memories before revision. 


9— Open the \omegat folder and delete all the memories there. Usually there will be the project_save.tmx file and 
several backups (example: project_save.tmx.201503121120.bak or project_save.tmx.bak). 


Leave the other files in this folder as they are. 


10 — You can also delete files — translation memories (retrievals or reference alignments) in the \tm folder, MT 
output in the mtl folder or any other files — that you consider of no interest to the reviser. 


11 — Copy this project to a location on a server that you have agreed with your reviser or which is the location used 
in your Unit/Language Department to exchange projects (or to a USB key). 


12 — Inform the reviser that the project is ready for revision and indicate the location. 


& 
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N.3.2. The reviser does the revision 


1— In Windows Explorer, copy to the OmegaT_Projects folder the project for revision prepared by the translator 
that was copied to the server (or copy it from the USB key provided). 


2— Open the project as usual via the DGT-OT Wizard to have automatic backups every 10 minutes. 


3— _ The process is the same as in translation mode. The only difference is that the documents are considered 
pre-translated and the segments are displayed in the Editor with an orange background to indicate that they 
were “auto-populated”. 


4— You can open all the segments or just the ones you want to change (View menu with Display Source 
Segments activated). 


Only the segments that you open and change will be identified with your login and will no longer be displayed 
with an orange background. The others will remain with the login of the translator and with the orange 
background. 


5— In the Fuzzy Matches pane, you will see first the segment from the files in the \tmlauto folder (the “draft” 
translation) and after them the segments from external memories, if any. 

& As OmegaT only keeps the last modified translation in the project memory, this way you can always recheck the 
segments you revised against the initial translation displayed in the Fuzzy Matches pane. 


6— You can see the Notes that the translator may have written and you can “communicate” with the translator 
writing your comment/answer or creating new notes. 


7— You can also add or change entries in the writable glossary, if any. 


8—  Atthe end you can check the segments you revised just by scrolling in the Editor and looking at the segments 
that are no longer with an orange background and opening them and changing them again if necessary. 


9— If you want to check the changes you have made with track-changes, you can — in Options — External TMX 
Options menu — tick the box View diff in target. Of course, you will have to open the segment to see the 
track-changes displayed in the target segment(s) in the Fuzzy Matches pane. 


If you do that, don’t forget — after finishing the revision — to change again this setting if you are afterwards 
going to do translation (and not revision) work. 


10 — If you also want to quickly check the changes you have made and not scroll in the Editor if it is a big project — 
just filter those segments by doing a Search by regular expressions, writing a dot (".”) in the field In translation 
and your login in the field Translator. 


™ Don't forget to tick the box in those 2 fields (and to untick them when you no longer need it). 


Those segments are displayed in the Search pane and you can filter them for editing in the Editor or just click 
on the number of a particular segment to open it in the Editor. 


11 — When you finish the revision, rename the project (for instance, {name-of-the-project-REVISED). 


12 — If you have the last word and you release the translation, just finalize your project sending translated documents 
to Tradesk and translated document memories to Euramis. 


13 — If the revised translation is to be finalized by the translator — or if you want to give the revised translation to the 
translator for information purposes only — copy the whole project to the same server location and inform the 
translator. 
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N.3.3. The translator finalizes the project 


If you are the translator and you have the last word: 
1— Copy to your computer the revised project from the server location. 
2— Open the project as usual via the DGT-OmegaT Wizard to have automatic backups. 


3— _ To easily see the segments that have been changed by the reviser, in the View menu activate Mark Revised 
Segments to have those segments marked with a red background. To see the changes made by the reviser, 
use the feature Go To Next/Previous Revised Segment (View menu) or the shortcuts Ctrl+Shift+X or 
Ctri+Shift+Y. 


OT goes to and opens the segment that is identified with the login of another user, in this case the reviser’s 
login. You can also do as explained in point 10 of the previous subsection to use the Search and Filter 
features. 


4— Accept the changes made by the reviser, reinsert your own translation from the Fuzzy Matches pane and/or do 
partial changes to any of them. 


5— Finalize your project following the normal procedure. 
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N.4. Revision of a project with several translators and 1 
reviser 


If there are several documents in a project that are translated by different translators and that will be revised by a 
single reviser, the process can be adapted from the one described above. 


This case applies when the documents translated by several translators are individual documents or parts of a large 
split document. 


In the case of a split document, if the revision is to be sent to the translators and finalized by them, then the originals 
must be split into several original documents which will be merged later in the native application. 


If the reviser has the last word and finalizes the whole project, the original document(s) can be split or not. 


In this case, probably there will be a “coordinator” — probably the reviser — of the project who creates a TeamBase 
memory to be shared by the translators and organises the project: specific reference translation memories, glossaries, 
etc. and prepares the individual projects for each translator from a “general” project with all the documents (or divided 
documents) that will be revised together. 


The coordinator will also do the Match Statistics per File in order to have global match statistics between all the 
documents with information on repetitions so as to be able to split the work in the best way. 


Each translator will work in an individual local project, possibly working in share mode via TeamBase. 


N.4.1. From the side of the translators 


The process can be done in several ways, but | think there is no point in having several projects prepared for revision. 
The simplest way is for each translator to: 


1 — Close any translated documents from that project that may be open. 
2 — Generate the translated document(s) by pressing Ctrl+D or Ctrl+Shift+D . 
3 — Copy the translated document(s) to Tradesk so that they are available there. 


4— Generate the project translation memory(ies) to be used for revision by pressing Ctri+Shift+F9 (or Tools — 
Create OmegaT Export) and selecting the document(s) in the window that pops up. 


OT will generate individual memory files — with alternative translations and notes (but without orphan segments) 
— to the project \export-omegat folder. 


5 — Rename that/those project memory(ies), for instance: RTD-2014-80083-partl-DRAFT-{your login} so that they 
can be easily distinguished from the ones of your fellow translators working in that project. 


6 — Copy that/those memory(ies) to a server location (or send it/them by email) and inform the reviser. 


N.4.2. From the side of the reviser 


The process is similar to the one described in Section N.3 above — the standard workflow of a complex project — with 
the exception that: 


1 — The reviser copies the export memory(ies) received from the translators to the tmlauto folder of a “global” project 
that s/he had already created as “coordinator” of that project or that was created by one of the translators and is 
made available in a server location and which has a descriptive name, for instance RTD-2014-70083-80084- 
Global. 


2 — If there are repeated segments in the several documents translated by different translators (detected with Match 
Statistics Per File), the reviser may give priority to one or more translators as explained in the Subsection N.2.2. 


3 — If the reviser has the last word, just follow the normal procedure to send translations to Tradesk and memories to 


Euramis. 
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If the revised translation is to be finalized by the translator(s) — or if the reviser wants to give the revised 
translation to the translator(s): 


4 — The reviser generates the revised translation memory(ies) to be sent to the translators by pressing Ctrl+Shift+F9 
(or Tools — Create OmegaT Export) 


5 — The reviser selects the document(s) to be sent to the translator(s) in the window that pops up. 


OT will generate individual memory files — with alternative translations and notes — to the project 
\export-omegat folder. 


7 — The reviser sends those individual memories to the translators for finalization or just for information purposes. 


N.4.3. From the side of the translator when finalizing the 
document(s) 


The process is the same as described in Section N.3 — standard workflow — with the exception that you (as the 
translator) should: 


1 — In Windows Explorer, do a copy of your original project — as in this case you did not prepare a project for revision 
— and rename it (for instance RTD-80083-REVISED) 


2 — Copy the memory(ies) you received from the reviser to the \tmlauto folder for pre-translation purposes. 


If you want (it’s optional) rename it/them (for instance, RTD-2014-80083-REVISED) so that there is no doubt that 
it/they is(are) memories after revision. 


3— Open the \omegat folder and delete all the memories there. Usually there will be a project_save.tmx file and 
several backups (example: project_save.tmx.201503121120.bak). Leave the other files as they are. 


4 — Copy the memory that you sent to the reviser — and that still is in the \export-omegat folder — to the \tm folder. 


& If you prefer, you can delete all the other reference memories in that folder so that you only see in the Fuzzy 
Matches pane the segments as revised by the reviser and your original translation. 


5 — If you have the last word, the rest of the process is the same as described in the standard workflow. 


If you just received the revised memory to be informed of the changes made, but it is the reviser that finalizes the 
document(s) and sends them to Tradesk (and the memory to Euramis), just follow the process if you want to see the 
changes that were made. 


N.4.4. The reviser receives other documents for revision while 
revision is already ongoing 


If the documents in a project are not sent to the reviser at the same time, the reviser can continue using the same 
project, just by updating it manually. 


The process, seen from the reviser side, can be: 
1 — The reviser starts the revision of a document in the project. 
2 — The reviser receives a new document for revision in the form of a translation memory sent by the translator. 


3— The reviser just copies that memory to the \tmlauto folder, giving it a priority or not, and the segments in that 
memory will auto-populate the “general” project memory (in the \omegat folder) but only for segments that are not 
yet in the memory ... and provided the original of the document the memory refers to is already (or is copied) to 
the \source folder in the project. 


Therefore, the segments already revised — if there are non-unique segments — will not be changed. 


4 — The reviser just resumes the revision within the workflow chosen for that particular project. 


= 
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N.5. Revision process in “cascade” 


The revision of the documents (or parts of documents) in a (large) project can be made in a cascading way, i.e. the 
reviser revises a first document, then sends the revised memory to the translator and afterwards receives a second 
document (already with the revision of the first document taken into account) and so on. There may be one or several 
translators involved. 


In this situation, the reviser can continue using the same project, just by updating it manually. 
1 — The reviser starts the revision of the first document. 


2 — The reviser revises the first document and the changes made are stored in the project memory (in the \omegat 
folder). 


3 — After finishing revising the first document, the reviser uses the command Ctrl+Shift+F9 (or Tools — Create 
OmegaT Export) to generate a memory of the revised document, which will be saved in the script_output 
folder. 


4 — The reviser sends this memory so generated — and identified as revised — to the translator who will check the 
reviser’s changes and take them into consideration in the next document that s/he sends to the reviser. 


5 — The reviser receives a new document for revision in the form of a translation memory sent by the translator. 


6 — The reviser just copies that memory to the \tmlauto folder, giving it a priority or not, and the segments in that 
memory will auto-populate the global project memory (in the \omegat folder) but only for segments that are not 
yet in the project memory ... and provided the original of the document the memory refers to has been copied to 
the \source folder in the project. 


™ Therefore, in case on non-unique (repeated) segments, the segments already revised will not be changed. 
7 — The reviser just resumes the revision work. 


8 — The process can be repeated if there are other documents to be revised in the same project. 
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— PART O— 
ATTRIBUTE CUSTOMIZATION 
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0.1. Preferences — Attributes in the Fuzzy Matches 
pane me 


OT displays the segments in the translation memories — called external TMX — of your project which have matches with 
the currently open segment in the Editor pane (MT output is displayed in a different window and not mixed with human 
translation). 


In DGT, we have the IT Unit which takes care of these technical aspects. However, you may want to change some of these 
defaults, namely the way the DGT attributes are displayed in the Fuzzy Matches pane identifying each segment and if you 
want — in the revision/finalizing phase — to see the track-changes in the target segments too. 


32 Extemal TMK Optons 
Sort fuzzy matches by: Tull text, induding tags and eumbers 
Please select how tags of nor-OmegaT TMXs should be etsplayed. 
Display tags 17 Use 108 for stendelone tags (e.g, <i>) 


7] View def in source View otf in target To see track changes in the target 
segments in the Fuzzy Matches pane 


Natth display template for the revision phase. 


|${2aLeSnorteath} ${instialCreationvate} 


Match; <siscore}/$inoStemScorel /${adiustedScore}4> - Source: <PiAtt:sReg. Serv.}-H{Att::Year}-#{Txt: Doc. No.}> ~ Translator: <@{Tat; :Transletoz} 


[(SE18)) -> ORT DIFF: Sidttt) Match display template which defines 


TH TRA: ${targetText} the way segments are identified in the 
Fuzzy Matches pane. See below. 


Temple wriles: fil) +) set | 


You can define here the attributes that will be displayed in 
the Fuzzy Matches pane. Just click on the icon to display 
the drop-down list of variables, select the one you want, 
position the cursor in the Match Display Template above 
and click on Insert. You can also write text to identify those 
variables. See Section 0.1.2 for a list of the variables 
available. 
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0.1.1. Attributes — Default and variant 


These are the default attributes so that you can see where the results come from if you organize your translation 
memories by subfolders. 


DEFAULT ATTRIBUTES: 


${fileShortPath} S{initialCreationDate} 
Match: <${score}/${noStemScore}/${adjustedScore}%> — Source: <@{Att::Req. Serv.}-@{Att:: Year}-@{Txt::Doc. 
No.}> — Translator: <@{Txt::Translator}> 
(${id}) —> ORI DIFF: ${diff} 
TM TRA: ${targetText} 


2-Norm-Memory\RTD-2013-80056-00-01-EN-ORI-00_EN-PT-RET.tmx (+8 more) 12/07/07 13:36 
Match: <100/100/83%> - Source: <TREN-2007-009910200> - Translator: <castrma> 
(6) -> ORI DIFF: The French Republic. 

TM TRA: Republica Francesa 


However, you may prefer to see less information or have it displayed in a different order. Here is an example of a 
minimalist display that you can easily copy/paste to the configuration window if you want. 


EXAMPLE OF AN ATTRIBUTES VARIANT if you want to see less information, for instance if you don’t 
organise memories by subfolders (example for the same fuzzy match segment). 
Just replace the text in the Match Display Template with copy/paste: 


Match: <${score}/${noStemScore}/${adjustedScore}%> — Source: <@{Att::Req. Serv.}-@{Att:: Year}-@{Txt::Doc. 
No.}> — <Date ${initialCreationDate} > — Translator: <@{Txt::Translator}> 
(${id}) —> ORI DIFF: ${diff} 


TM TRA: ${targetText} 


Match: <100/100/83%> - Source: <TREN-2007-009910200> - <Date 12/07/07 13:36 > - Translator: <castrma> 
(6) -> ORI DIFF: The French Republic. 
TM TRA: Republica Francesa 
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On the other hand, you might prefer to have more information, for example, see displayed in the Fuzzy Matches pane the 
notes that you have inserted in the segments you translate in the project. 


EXAMPLE OF AN ATTRIBUTE VARIANT if you want to see more information, for instance if you want 
to see the notes of translated segments in your project displayed in the Fuzzy Matches pane. 
Just replace the text in the Match Display Template with copy/paste: 


${preamble}) - ${creationDate} - Source: <@{Att::Req. Serv.}-@{Att::Year}-@{Txt::Doc. No.}> - Translator: 
<@{Txt::Translator}> - Created by: <${creation|d}> 

-> ORI: ${sourceText} 

-> TRA: ${targetText} 


3>) - 19/06/15 11:29 - Source: <-> - Translator: <> - Created by: <machame> 


-> ORI: on the approval of the Porsche AG coasting function as an innovative technology for reducing CO<t0/>2<t1/> emissions from passenger cars 
pursuant to Regulation (EC) No 443/2009 of the European Parllament and of the Council 


> we relative & sisneer da lpr amarcha em roda fvre» do Porsche AG oe uma ae Inovadora para reduzir as emissdes de CO<t0/>2<11/> 


<Note>: 5 em rode ire, cuales: os inercis, iment aoe inércia, ca com 0 motor oun 
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0.1.2. Attributes explained 


In this section is explained the way the variables work so that you can customize the attributes as you wish. 


Variables prefixed by ${...} are based on standard attributes or mark-ups from the TMX file, while those prefixed by 
@{...} are TMX "properties", which mean that they may exist in some files and not in other ones. 


memory. 


On the contrary @ffile}, @{id}, etc. are Omegat's internals, which should normally not exist in tmx from the \tm folder, 
unless you copy here the project_save.tmx from another project. 


Here is the list of variables available: 


The position of the segment in the matches list. Don't confuse with @{id} 
which is an OmegaT-internal feature. 


The note, if any, can be displayed together with the segment for segments 
in the project memory or in the external memories. 


| Source text of the match 


String showing the differences between the source and the match, i.e., 
new text in the segment in the Editor is displayed in the Fuzzy Matches 
pane in blue and text not present in the segment to translate is displayed 
in red with strikethrough. 


Hint: use this if the text you are translating has been updated. 


S{diffReversed} As its name indicates, it shows the differences in a reverse manner — 
between the match and the source —, i.e. new text in the segment in the 
Editor is displayed in the Fuzzy Matches pane in red with strikethrough and 
text not present in the segment to translate is displayed in blue. 


is targetTexth Target text of the match 
si Track changes showing the difference between the target segment open 


in the Editor and the translation memory. Used when — after the 
document(s) have been revised — the translator sees the changes 
introduced by the reviser to accept them or not. 


—z S{score} Match percentage taking into account tokenizers 
Percentage without numbers and tags. Default OmegaT match — number 
of matched words — with numerals and tags ignored — divided by the 
total word count 
} 


S{adjustedScore Percentage adjusted. OmegaT match, including numbers, tags 


Name of the tmx file, without path nor extension 
Full path of the tmx file 


Path of the tmx file starting from the root of \tm. 


Note: Important to keep it if you give priority and descriptive names to 
subfolders in the \tm folder. 
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S{initialCreationtd} Identical to ${creationld}, used by Omegat public version for historical 
reasons. 

S{initialCreationDate} Identical to ${creationld}, used by Omegat public version for historical 
reasons. 


S{changedid The author of the last version of this segment (tmx standard attribute; 
modified by OmegaT when you modify a segment) 

S{changedDate} The date of the last version of this segment (tmx standard attribute; 
modified by OmegaT when you modify a segment) 

Indicate that this match is fuzzy (currently only for translations from PO 
files with the #fuzzy mark) 


Only for alternative translations 
The source file this segment comes from (only for project memory 
segments, not for external memories). Omegat-internal 

Only for alternative translations 
For some file formats, such as Java Properties, the key used by 
software. OmegaT-internal. Don't confuse with ${id} which is related to 
the matches list. 

@{prev} 


Only for alternative translations 


Only for document files (docx, odf, ...) : the previous segment's text 


Only for alternative translations 


Only for document files (docx, odf, ...) : the next segment's text 


Only for alternative translations 
For some file formats, such as PO, the key used by software. OmegaT- 
internal. 


LJ CORNICE Banh Identification of the requesting DG 
LJ @{Att::Year Identification of the year in the document 


LJ Set Doc Tepe Type of document as defined in Euramis 
LJ ati staoe No: Number of the document 
LJ MES Person who translated or aligned the document 


@{Txt::Translator}> Translator who translated the document. For Eur-Lex documents there 
is never a translator name 
LJ @{Txt::TM Database} Euramis Database in which the translation memory is stored 
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0.2. Preferences — Attributes in Search menus se 


The Search feature in OT is very complete and allows to search by a series of criteria. 
In DGT, this feature has been substantially improved with new features very useful for translators. 


® Part G for detailed information on the Search options. 


In this section are only explained the options to customize the Attributes display in the Search window (your 
preferences). 


You can change them any way you want by clicking on Configure format and choosing the variables — in the Match 
Display Template — ordering them the way you prefer. You can also easily change it any time you want. 


_| Sama text in al fids AND DOR 


You can define here the information that will be displayed 
in the Search window. Just click on the icon to display the 
dropdown list of variables, select the one you want, 
position the cursor in the Match Display Template above 
and click on Insert. You can also write text to identify those 
variables. See Section 0.2.2. for the list of variables 
available. 


0.2.1. Attributes — Default 


DEFAULT ATTRIBUTES: 
${preamble}) — ${creationDate} — Source: <@{Att::Req. Serv.}-@{Att:: Year}-@{Txt::Doc. No.}> — Translator: 
<@{Txt::Translator}> — Created by: <${creation|d}> 

—> ORI: ${sourceText} 

—> TRA: ${targetText} 


TM:2-Nom-Memory\R TD-2013-80055-00-01-EN-ORI-O0_EN-PT-RET .tmx>) - 02/02/12 17:46 - Source: <-2011-32011D0719> - 
Translator: <> - Created by: <ribeica> 


-> ORI: The Kingdom of the Netherlands 
-> TRA: Reino dos Paises Baixos 
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0.2.2. Attributes explained 


Here is the list of variables available, which is similar, but not identical, to the variables available for the Attributes 
displayed in the Fuzzy Matches pane. 


This is the beginning of the message in OmegaT-public: 


- for a memory segment, this is the number 


- for a tmx file, this is the file name. 


The author of the first translation of this segment (tmx standard 
attribute) 
The date of the first translation of this segment (tmx standard 
attribute 


The note, if any, can be displayed together with the segment for 
segments in the project memory or in external memories. 


Name of the file, without the path but with the extension 
S{fileNameOnly} Name of the tmx file 


Full path of the tmx file 


Path of the tmx file name starting from the root of \tm. 


Note: Important to keep it if you give priority and descriptive 
names to subfolders in the \tm folder. 


RESIN 
there is never a translator name 


Euramis Database in which the translation memory is stored 
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— PART P— 
TROUBLESHOOTING 
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P.1. List of problems 


Here is a list of problems | already had to deal with together and ways to fix it ... hopefully. 


If you cannot solve it ... call for help! 


PROBLEM 


MAYBE 


TROUBLESHOOTING 


SEE SECTION 


The project has been 
created in the DGT-OT 
Wizard but OT gives 
an error and doesn’t 
open the project 


The project — which 
had been working 
normally in other 
sessions — doesn’t 
open 


Machine _ Translation 
and Translation 
Memories were not 
copied to the project 


Machine — Translation 
not copied to the 
project 


Target documents not 
created 


You deleted one or 
more files from the 
project by mistake 
and want to get them 
back 


You did a BIG mistake 
with Replace All and 
you want to do UNDO 


It is a problem with the 


_TagWipe 


It is a problem with the 
omegat.project file 


“Maybe you forgot to define 
language 


the correct 
combination 


The MT file was _ not 
generated because the 
language pair is not 
processed automatically or 
itis a multilingual project. 


You have one or more 
translated documents open 
in their native applications 


| 


| Create the project again in 


the DGT-OT Wizard with 
TagWipe unticked. 
If it is OK, work with 
_ Remove Tags activated ... 


_orask for Help. 


Create a new project with 
any document and copy 
the whole project which 
has a problem to the new 
project with the exception 
of the omegat.project file 
| 
If that is the problem, 
create the project again — 
_with the correct language 
combination — with a 


| different name. 


Request MT from the 
MT@EC service directly 


documents 


open and 
Create the Translated 
Documents again 


You can retrieve it from 
the H:drive where a copy 
of your project is kept or 
Restore it from the 
Recycle Bin 


| You can use a backup of 


the project memory in the 


| \omegat folder. 


Check that there are no 


See Section P.2. below 


See Section P.3. below 


See Section P.4. below 
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P.2. omegat.project file corrupted 


This is an essential file to open the project. If this file is corrupted, you will not be able to open the project and OT will 
display an error message. 


If this file gets corrupted (something which happens very, very rarely in my experience) don’t panic! 
A fast and practical way to solve this problem is to: 
1— Create a new project with any document using the DGT-OT Wizard. 
2— Delete everything from that project except the omegat. project file. 
3— raed everything from the old project to the new one... with the exception of the corrupted omegat.project file, 
of course! 


Hopefully, you will solve the problem painlessly! 


P.3. Retrieving a deleted file 


If you delete, by accident, any file in a OT project, you can use the backup in the H:drive to get it back. 


The backup that is done every 10 minutes only copies new files or modified files to the H:drive. It does not 
delete files — even if they have been deleted in your project. 


The projects’ backup is here: H:\CAT\OmegaT_Projects. 


Another way to retrieve deleted files is to Restore them from the Recycle Bin. 
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P.4. Using a project memory backup 


As OmegaT Undo/Redo feature only works within an open segment in the Editor, you will have to be careful when 
doing batch operation like Search/Replace or Search/Translate as, if you make a mistake, you cannot go back with 
Undo/Redo. 


So, suggest that you do a Save (Ctrl+S) before doing a batch operation. 


In extremis — if you made a really BIG mistake — you can use a previous backup of your project memory 
stored in the project \omegat subfolder. 


The segments you translate in your project are saved in the project_save.tmx and OT does automatic backups — 
every 3 minutes — of that memory, which are identified by the year, month, day and hour and have the extension .bak. 
Name 


bf files_order.txt 

bf ignored_words.txt 

4) last_entry.properties 

bf learned_words.txt 

4] project_save.tmx 
project_save.tmx.201503251338.bak 
project_save.tmx.201503251600.bak 
project_save.tmx.201503261559.bak 
project_save.tmx.201503270926.bak 
project_save.tmx.201503271029.bak 
project_save.tmx.201503271556.bak 
project_save.tmx.201503271706.bak 
project_save.tmx.201504011130.bak 
project savetmx2015040 ba 


project_save.tmx.201504071715.bak 


project_save.tmx.bak 


You can use a previous version of your project memory by: 

1— _ Renaming the project_save.tm«x file (for instance, project_save-old.tmx). 

2— Renaming (one of) the last backups before you carried out the problematic operation as project_save.tmx 
3— _ Clicking on Reload. 


OT will read the new project memory — “forgetting” the last changes made — and updates the segments displayed in 
the Editor. 


You may lose some work, but not a lot and ... hopefully ... you will solve the problem in a few seconds/minutes. 


i 
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— PartQ— 
List of shortcuts 
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If you want you can change the shortcuts to suit your preferences. This is not covered in this Guide. 


® the Appendix H of public OT Help for information on how to customize shortcuts. 


However, as DGT-OmegaT has DGT-specific shortcuts, here is a consolidated list with the public version and 
DGT-specific shortcuts. There is also the indication of the shortcuts that are still free. Not many! 


LIST OF SHORTCUTS ordered by shortcut 


Public and DGT-specific features 


Select Previous Match 


SS (2 (5 
I (20 (2 
[______] elect Fuzzy Match?_——SS—S—S—S—SCSSCSCSCSCSCSCSCSCid CT SCSC~* 
SS (2 (2 
(__ elect Fuzzy Matcha__——S—S—~—~CSCSCSCSCSCSCSCSCSCS~id a SCSC~C~*d 
[___]elect Fuzzy Match8__—SS—S—S—SCSCSCSCSCSCSCSCSCi CS SCSC~C~* 
[______]setectaSSSSCS™S—SSSSCCSC“‘*~*éd CA SSCSC~C~*S 
ire s—~=“‘“CNSCO#éC‘“(‘(#SCOC#“CO#“C’”SC#*C“C#*d‘NNSC*C*é‘<é‘®W 
SS a (2 
SS (2 ( 
[_____]Properties.. ____————S—S—SOOSCCC~CidECSC~* 
[_____]Search Project, ——SsS—S—S—SOSSCSCSCSCSCSCSC~id CE SSC~* 
SS (a (2 
a ( 
___]ProjectFies. _SsS—S—CSCSSCSCSCSCSCSCSCSCSC“‘i A SCSC~*d 
[________] Replace with Machine Translation ___————S—~idiura CCS 


Ctrl+N or Enter or 


Ctrl+O8=80sSdY 


a or 
Previous Segment Ctrl+Enter or 

Ctrl+Tab 
a Ctra 
i Ctrl+R 
Ctrl+S 

Ctrl+Shift+A 

Ctrl+Shift+B 2 
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Public and DGT-specific features 


Export Selection 
Create Current Translated Document 


= 


iw] 
ie} 
oO 
al 
=] 
a 
(0) 
= 


Create Glossary Entry 


PT iinsertsource CO t+ Shift + 
| = —_| Validate Tags for Current Document Ctrl+Shift+J Bi 


Search Directory Ctrl+Shift+K 


IATE 
Source of selected match 


OFOTOTOTOTO 
ees Fe -- Coal Mons 
ae eS bo ee Ee 8 
+Ut 0+ T+ 0+ T+ 
OINTNTNTONTW 
DIF 12 1721. 
ae es ee ee ee 
+U+ 0+ 0+ P+ T+ 
ZION mM~EOIO 


we 


RTL-LTR switch 


Replace with source 


jes) ‘Tl 

2 S 

x = 

store 

ae 

who |S 

3 =. 

< 2 

: 5 

< 
Ofolofofofololo 
Co oe ee Me ee en Me onal 
ee eS ee es es es es 
+U+ 0+ [+ 0+ 04 14 7 + 
OLHTHTHLHLalala 
SiS oyo>y5>i5>i->i-> 
ea ed ed ed ed a a 
ae oe oe Ee oe oe ne ey 
oOfviofujolzi=ic 


Register Identical Translation 


| ifinsert Missing SourceTags Ctrl Shift+T 
next ransiated Segment ders 
__]valdatetags____S—S—~—~—~—SCSCSCSCSCSCSCSCSCSCSCSd shinee 
ose Ltr Shifts 


Write Query Notes to File Ctrl+Shift+10 
Ctrl+Shift+11 
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Public and DGT-specific features FShorcut 


Shift+F3 
Shift+Ctrl+N 
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— ANNEX — 
MACHINE TRANSLATION: 
What Makes Moses Tick 
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Moses for Mere Mortals 


This Annex is an excerpt of the Moses for Mere Mortals Tutorial and is meant for absolute beginners. If you are already 
familiar with Statistical Machine Translation, skip it. 


It was meant to present, in extremely simplistic terms, a general idea of what Moses does so that the several stages of 
using Moses via Moses for Mere Mortals (MMM) could be better understood. 


In this overview are used illustrations in presentations by Philipp Koehn available on the Internet and screenshots from 
Moses for Mere Mortals. 


|. Statistical Machine Translation (SMT) 


SMT was a breakthrough in the field of Machine Translation with the publication of the seminal paper Statistical 
Phrase-Based Translation by Philipp Koehn, Franz Josef Och and Daniel Marcu in 2003!. 


In 2007, the open-source Moses SMT system was first made available within EuroMatrix, a research project co-funded 
by the European Commission. 


As SMT is language-independent — i.e. it is possible to train whatever language pair without the long and consuming 
work demanded by rule-based MT — it soon became the basis for services like Google or Bing. 


The breakthrough was that it builds on the work done by the IBM Labs in the 90's, evolving from word-based (Figure 1) 
to phrase-based models (Figure 2) ... and that made all the difference! 


Word-Based Models 


ry gid t glap t green witch 


n(?}elap) 


Mgry yot elap elap elgp the green witeh 
ws ¥ ~ x \ p-mull 
Mary pot ojap slap elap bt L the green witch 
\ \ | t (la|the) 
1 


Maria upa botefada veigde bruja 
aia/4) 


— 
ria no dabs a bofetada a 14 bruja verde 


; [foes Koigtt. 1997] 
e Translation process is decomposed into smatlor stops, 


each ts tied to words 


e Original models for statistical machine translation [Brown et al., 1993] 


Hipp Koon > utorral 


Figure 1 — Word-based Models 


: http://homepages.inf.ed.ac.uk/pkoehn/publications/phrase2003.pdf 


Ly 


DGT-OMEGAT, ITS WIZARD WIZARD AND DGT’S CAT ENVIRONMENT — A TRANSLATOR’S GUIDE — MJM — June 2015 


Phrase-Based Models 


[reraee [Fiza] [tee] [nach vaca] [aur rontenene| 
[Romer] [2] [es Air] [Fo cm contaranca [im Gonna 


. : ' - [tom Koehn et al, 2002, NAACL] 
e Foreign input is segmented in phrases 


— any sequence of words, not necessarily linguistically motivated 
e Each phrase is translated into English 


e Phrases are reordered 


Figure 2 — Phrase-based Models 


Very important also is that it became recently viable to have large corpora using a “by-product” of MT research: the 
Translation Memories. 


It is the training of these large corpora built from high-quality human translations — aligned as translation memories 
and later converted — that made possible Statistical Machine Translation. 


Therefore, there is in fact a synergy between human translation and machine translation. 


For professional translators, this can be a “virtuous circle”: machine translation is enriched with human translations 
(that are relevant for a particular domain) and, in turn, machine translation helps translators in their work. 


ll. What is a bilingual corpus 


Parallel data is a collection of sentences in two different languages, which is sentence-aligned, in that each sentence 
in one language is matched with its corresponding translated sentence in the other language. It is also known as a 
bitext. 


To train MT engines with Moses you need (a large amount of) data. 
Now with the widespread use of CAT tools, translated documents are usually stored as translation memory files. 


Translation memories are bilingual files which contain the source segments and the target human perfectly aligned 
translation in Translation Memory eXchange format (tmx) files (Figure 3). 


1] 
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Aligned document in tmx format 


SKET used to employ approximately 1 800 people and was the 
largest manufacturer of machinery and equipment in the new Lander 


|Osavaltioiden suurin koneita ja laitteita valmistava yritys. 


FI 
Silla oli noin 1 800 tyOntekijaa ja se oli edelleen Saksan uusien 


COMMISSION DECISION of 26 June 1997 concerning State aid in 
avour of SKET Schwermaschinenbau Magdeburg GmbH (SKET 
SMM), Saxony-Anhalt (Only the German text is authentic) (Text with 
EEA relevance) (97/765/EC) 


OMISSION PAATOS, tehty 26 paivana kesakuuta 1997, valtion 
esta SKET Schwermaschinenbau Magdeburg GmbH lle (SKET 
SMM), Sachsen-Anhalt (Ainoastaan saksankielinen teksti on 
odistusvoimainen) (ETA:n kannalta merkityksellinen teksti) 
97/765/EY) 


and in particular the first subparagraph of Article 93 (2) thereof, 


|bttaa huomioon Euroopan yhteis6n perustamissopimuksen ja 
|@rityisesti sen 93 artikian 2 kohdan ensimmaisen alakohdan, 


Having regard to the Agreement establishing the European Economic 
Area, and in particular Article 62 (1) (a) thereof, 


)ttaa huomioon Euroopan talousalueesta tendyn sopimuksen ja 
rityisesti sen 62 artiklan 1 kohdan 1 alakohdan, 


Having given notice in accordance with Article 93 to interested parties 
‘0 submit their comments, 


bn kehottanut niita, joita asia koskee, esittamaan maaraajassa 
huomautuksensa perustamissopimuksen 93 artikian maaraysten 


ukaisesti.seka katsoo seuraavaa: 


By letter dated 21 March 1995 (1) the Commission informed the 
German Government of its decision to initiate proceedings pursuant 
‘0 Article 93 (2) of the EC Treaty in respect of aid granted by the 
Treuhandanstalt (THA) and its successor organization, the 
Bundesanstalt fur vereinigungsbedingte Sonderaufgaben (BvS), to 
SKET Schwermaschinenbau Magdeburg GmbH (SKET SMM). 


omissio ilmoitti Saksan hallitukselle kirjeella 21 paivalta maaliskuuta 
995 (1) paatéksestaan aloittaa EY:n perustamissopimuksen 93 
rtiklan 2 kohdan mukainen menettely, joka koskee 
reuhandanstaltin, jallempana “THA’ ja sen seuraajan, 
3undesanstalt fur vereinigungsbedingte Sonderaufgabenin. 
\jempana “BvS’, SKET Schwermaschinenbau Magdeburg GmbH lle, 
Aljempana “SKET SMM’, my6ntamaa tukea 


he enterprise was located in the new German Land of Saxony- 
Anhalt and has since filed for bankruptcy 


|lvainittu yritys, joka on sittemmin joutunut konkurssiin, sijaitsee 


sachsen-Anhaltissa, yndessa Saksan uusista osavaltioista 


Its product range included rolling mills, wire-drawing mills, cranes, 
teel wire and cable-making machines, dressing and sizing 


5en tuotevalikoimaan kuuluivat valssaamot, langanvetolaitokset, 
hosturit, terasiangan, kaapelin ja terask6yden valmistuskoneet, 


hd 4 


Figure 3 — Display of a tmx file 


These tmx files can be merged and 
EXTRACT_TMX_CORPUS_1.043.EXE included in MMM. 


subsequently — split 


into 2 files with the application 


The resulting 2 files — one with the source segments and the other with the target segments — are perfectly aligned in 
text only (UTF-8) format (i.e., stripped of formatting). This is what a corpus looks like (Figure 4). 


CORPUS TO TRAIN THE 


COME2010) wre 

Sixth Repost om the Statietics on the Nusber of Animeis 
Used foe Experimental and other Seientitin Purposes tm the 
Hesper States OF the Ouropean Unien 

COM (2010) 

THTROOUCT ION 

The ebjeetive of thie eepert ie te present co the Coumess 
and che Europese Pariiement. in sccordance with Articie 26 
of Directive 06/409/EEC of 24 Movenber 1906 on the 
approximation of lers, regulations and administrative 
DeOvislons of the Menber States regarding the poowection 
Of animale weed for experimental and ooher setencific 
purposes, She statieticel data on the numer of animale 
uned for experimental and other scientific purposes tm the 
Neaber Staces of the £9. 

OD |, FSH, th. 92, 9088, 9. t 

The Litst two statistics! teputts drafted in acocedance 
with the provisions of the shows mentioned directive which 
were published ta iff) ana iv??, Covering date on 

HKper imental anteele colleeted in 1001 and 1906 
Cespectiveiy im the Bemmer States, ailoved oniy @ iimitea 
mount of stetietios! enslysie due te the aboenace of « 
consistent system of ceporting the data, 

COM (04) 105 temas 

In 1997 an agreement war reached between the Commiasion 
and the competent sutheritics of the Member States to 
oubeat date for future reports using = format of eight 
harmonised sables- 

The chied and fourth statietioal ceporte vublished is 2003 
and D003 covering date collected in 1999 and 2002 were 


TRANSLATION MODEL 


KOM (2OLO) + vyyE 

Kuudes* ket tomus + Kurcopas’ umsonin’ JAPenvRIt iC sen’ KoKketatin: 
Pe mULDE NC hetoel liek in: Core st iws in: meytentysen-elainten: 
LUKWNOOE RA: KOSKEV Leta Ci lestolstas 

ROM (2010)4 

SONDANTOE 

TAPOA- hee COMUNE BOA GAL TRTAAR BH UORTOLIE- }a-EUroopen 

per Llewentilie-EUin: asenvealt ioisea-mokeisiin: Sa-muihine 
tietecllisiin: tarhoitokeiin: Raytettavien elainten 
lukue@arcna kogkevat tt laatct, -kuten-rokeisiin: .a:muthin’ 
Rieteelliaiin: Ceskoitoks tia Meytettevien-eleinten:eucjetua: 
Hoshevien: Sasenvelt totden: lalrien, -asetusten- da- 
ballinnol listen méacaysten: lAbentemiseota dt: phivann: 
marraskuuta: 1906+ annetum: nacvoston-direktiivin:96/600/ETT: 
S6+mctikiassa‘eaetiyeecada.® 

EYUL- 1-388, +18, 22, 1004, +8. -9.8 

Ens imme inet -haksi+ ti lastolliata-ket Lomusta, > putha: 
lnaditetinsedellAcmeinitem-dsrekrt i ivin-samnndeten: 

MURS eweti,  JulKeist tin vuow ine: 1Fh4* pe° PPR. Nitece 

he Lewes Len whee lel: Skeenvalt tOhewe-wuonne- 1901+ beret ypa- 
TISLOPA’ DO- POLK ee hem -yuOREe: IVFE- RECALL Y a+b 1etoje: koe~ 
Slsimiets, ‘Niivoe: tehty: ttlertoensiyyri+ oli: kuitenkin 
HYVIN' TAPOLCECCU, ‘KOsKR KAytEetteviera-es-ollut: 
JOhdormuke Leta 6 LOCO One apoE TOL Ht LIAN eetelaan. @ 

KOM (94) -195- Lopullines.€ 

Vuonna: 1997 howhesto ja: Jeeceraitioiden toimiveltainet 
ViTAncemi set  PaAsiVEt sop imekseen: 21204, et ta myoheme ta 
HOT LOMURG LA VOECEN- ShOGOT TOMMETETC AISI In: kahdekaana: 
Vhdenmuketsena-touluaxona.# 

Vuonna~ 2003+ julkeiot tin kolmas-tilestellinen kectomus, 


Figure 4 — Bilingual corpus in UTF-8 format 


256 
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Moses will process those files, in what is called a ‘training’ and will create what is called ‘MT engines’. When the 
training is finished you will be able to start translating with it. 


There are several corpora freely available which allow building MT engines for all the EU official languages and also for 
some other languages like Chinese, Arabic. 


You can add your domain-specific data to these corpora if you have it. If you don’t have your documents aligned you 
can use open-source tools like hunalign = (http://mokk.bme.hu/resources/hunalign/) or LFAligner 


(http://sourceforge.net/projects/aligner/). 


Bilingual corpora can also be aligned at paragraph level, but it is generally thought that it is better to have them aligned 
at sentence (segment) level for MT purposes. 


ill. Tokens and n-grams 


Important concepts in SMT are tokens and n-gram units. 


Tokens are the basic unit in a machine translation process: tokens are a sequence of characters, such as words, 
punctuation or symbols, separated by a space. When the training is launched, the corpora are first tokenised. 


An n-gram is a subsequence of n number of (1, 2, 3, etc) items in a larger sequence. 


In a Language Model, n-grams are sequences of tokens. In MMM, you can choose the number of n-grams you want to 
use in a training. The default is 7, but you can use values between 3 and 9. Obviously, 7-gram Language Models take 
more time than 3-gram models and generate bigger files. 


In Phrase Tables and Reordering Tables, n-grams are sequences of pairs of source and target language tokens. 


In MMM, the default is 7, but you can choose values between 3 and 9. As for the Language Model, the higher the 
n-grams, the larger the Phrase Table will be, the more time it will take to generate the Phrase Table and the bigger the 
files will be. 


Depending on what you are using MT for, you may prefer speed to quality or vice-versa. 


IV. What is a monolingual corpus 


To train MT engines with Moses you also need monolingual data for certain parts of the training to generate: 
i) the Language Model, which tries to ensure the fluency of the machine translation output 


ii) the Recaser Model so that the machine translation output has the right case in words that have 
capital letters. 


You can use the target language file of the bilingual corpus already extracted (see Figure 5). But you can also add to it 
texts in the target language of the language pair you want to train. 


Monolingual data is much easier to get just by converting documents into the txt (UTF-8) format to be used by Moses 
to generate the language model. 
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KOM (2010) -yyyZ 
Kuudes: kecComus-Eurcopan: unionin: jasenvaltioissa: kokeisiin- 
Ja-tuthin-tiereellisiin-carkoituksiin: kaycertyjen-elaincen- 
lukumé4re64-koskevista-tilastoistas 
KON (2010) 4 
JORNDANTOR 
Tass: ker Ccwuksessa: esi tetaean:nevvostolie: ja: Buroopan: 
par lementille-£U:n-jasenvaitioissa:kokeisiin: ja-muihin- 
tieteeliisiin-tarkoituksiin-xAytettavien‘elainten: 
= iukveSarOa- koskevat + tilaztot, -kuten-kokeisiin-ja-mushin- 
provisions of the b va@iing the protection tieteellisiin-carkoituksiin:kAytettavien-elainten:sucjeluna- 
Of animals used for exp other scientific koskevien jaszenvalt iciden-lakien, -asetusten: ja: 
purposes, the statistical hallinnollisten-maaraysten- lanentémisesta-24-paivana: 
used for experimental and o f marraskuuta- 1984-annetun-neuvoston:direktiivin:86/605/ETY- 
Member States of the EU. 26-arctikiassa-edellycecaan.¢ 
OF L 356, 16.12.1906, p.t. LYVL+L +356, +106.12.1966,+5.+1.2 
Ena immbizet-kakai-+tilestolliata:-kertostuata, + jotkar 
lasdittiin-edella-mainitun-direktiivin: saannosten: 
muknisestt, -julkaintiin-vuonina: 1994+ja- 1999. -Niista- 
Ons immbinen-sisdlsi- jésenvaltioissa:yucnna: 1991-kerattyja- 
Ti4TOla Ja-IGlKimAi nen vuonna- 1996+ keracty34-tietoja:koe- 
elAimista. -Niias&-tehty-tilastoanalyysi-oli-kuitenkis- 
hyvin-tajoitettu, -koska-kaytetctavissaé-e1-ollut- 
Johdonsukaista:tietrojenraporcointijarjeste ima. = 
KON ($4) -i95-lopullines.@ 
Vuonna: 1997+ komissio+ja-jasenvaltioiden-toimivaltaiset + 
Viranomaiset + pAAsivAt-:sopimukseen:stita, -etta-myohempia- 
wettomuksia-varten-tiedot-toimitettaiesin: kahdeksana- 
hircd and fourth statistical reports published in 260: yhdenmukaisena-taulukkona.g 
id 2005 covering data collected in 1997 and 2002 vere Al) Vuonsa:2003- julkaistiin-kolmas-tilastollinen-kertomus, « 


Figure 5 — Monolingual corpus in UTF-8 format 


V. What is training a corpus 


When you have the corpora, these must be processed by MMM/Moses. The steps involved may differ depending on 
the options concerning language model, tuning, etc.. However, in general terms and just to have an idea, these are the 
steps in a training with the IRSTLM language model: 


1) Corpus preparation: 
a. Tokenisation: insertion of spaces between (e.g.) words and punctuation. 
b. Lowercase data. 
c. Cleaning: long sentences and empty sentences are removed as they can cause problems. 
d. Furthermore, some characters that have proven to give problems are eliminated or converted. 


2) Alanguage model (LM) is generated using one of the LM tools — IRSTLM (by default in MMM) or RANDLM: 


3) Word alignment: The training corpus — which is already aligned at segment level — is further aligned at 
subsegment level using a word aligner. 


In MMM, the word aligner is MGIZA++; 


4) Atranslation model (phrase table) is generated which contains phrase-to-phrase translations extracted from 
the training corpus; 


5) Other models are also generated like the Reordering and Recasing Models; 
6) Optionally, tuning can also be done to try to improve the quality of MT output; 
7) Optionally, it can also automatically score MT output with automatic metrics. 
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In MMM, are available the BLEU and NIST metrics. 


2 0006668 6696668860895 8 6880000806688 5555586 08080800 088000686008086800000600800800 


2e** Duration *¢¢: 


4 222222 SS SS SSS SSS SSS SSS SS SSS SE SE ES ESSE ES SS SS SSS SSS SSS SS SEES 


S Start time: day: 30/10/2014-tt(me:21:33:160 

6 stert tenguage model buttiding: day:30/10/14-ttme:21:33:16 

7 Stert recaser tratnting: day:30/10/14-time:21:41:31 

8 start corpus training: day: 30/10/14-time:21:43:11 

9 Start memory-mapping: day: 30/10/14-time:23:11:49 

10 Start tuntnge: 

11 Stert test: day: 30/10/14-time: 23:18:15 

i2 Stert scoring: day: 36/106/14-time: 23:21:28 

iS End time: day = 360/16/14-time:23:21:30 

14 sees eee Beene SR ERE ESE ERR REE EEE EEE EE EE ee eee 


is*** Languages*** 

Le ee ee ee ee ee 
17 Source Language: pt 

18 Target Language: en 


270 *** Training <«s-teps in fact executed *** : 
21 ese eee SS SSS SSS SSS SES SS SSS SES SSE SSS SSS SSS SS eee eee 
22 Laenguege model buttiding executed=yes 

23 Recaser traetning executedwyes 

24 Corpus tratning executedeyes 

25 Paraltel tratnting executed=yes 

2o6O First tratetng step= 

27 Lest tretning step«9 

28 Corpus menmmappting executed«yes 

29 Tuntng executed=no 

30 Training test executed«yes 

Bi scoring executedeyes 


Figure 6 — Example of the report of a training of the 200 000 Demo corpus (PT-EN) with the MMM defaults 
(without tuning) 


VI. What a Translation Model (Phrase Table) looks like 


The phrase table is a kind of bilingual dictionary with probabilities computed during the training process. The phrase 
table is what is used by the system to try to guarantee the correctness of the translation, i.e. that “the black cat” is 
translated in the target language by its equivalent (“le chat noir”) and not by something like “le chat jaune”. 


Just to have an idea, a 3.4 million segment corpus (with 64.4 million words (EN)), which is available in the MMM 
website, can generate a phrase table with about 200 000 entries ... with different degrees of reliability (probabilities as 
computed during the training). 


Depending on the choice made, the Phrase Table may have entries with up to 9 grams. In MMM, the default is 7. 


5192 actividades de investigacdéo « ||| the activittes of research and [jf t 6.196191 6.5 66505533 2.718 [|] O-1 1-2 2-3 Dea [PP tat 
5193 actividades de tnvestigacao |] activities of research ||] 2 0.154602 0.5 0.225131 2.718 [Jf] @-O 1-1 2-2 [Jf 221 

5194 actividades de investigacdo [|| the activities of research |[] 1 6.154862 0.5 0.0041159 2.718 ||} O-1 1-2 2-3 [J] 124 

5195S actividades de |p] activittes of ||| @.2 6.154862 6.5 0.225531 2.718 ||| O- ries2d 

5196 actividades de |]] the activittes of |[[ 1 6.154662 0.5 0.0641159 2.710 IIT 21th a2 

$197 actividades defintdas no artigo 1° ||| activittes as set out in article 1 [J] 1 4.00002S078 ft 6.00279051 2.718 ||] o 
5198 actividades definidas no artigo |{] activities as set out tn article ||| 1 6.60540043 1 6.00279051 2.718 |] | O-0 1-1 
5199 actividades definidas no ||| activitles as set out tn |[] 1 0,.00546945 1 0.00294 2.716 JJ] O-O 1-1 1-2 t-3 2-4 JEP 
4200 actividades definidas ||] activitles as set out |{] 1 0.0252204 1 2.728 [J] O-O 1-1 1-2 2-3 TEP 248 

$201 actividades no Anbito do presente ||| activities under thts ||] 0 002122585 1 0,0723819 2.718 [[| O-0 2-0 2-0 2<4 3-1 4-2 [J] 231 
5282 actividades no dnbito do [|] activitles under ||] 6.5 @.00240336 1 0.676598 2.710 ||] 0-0 1-@ 2-6 2-1 3+) [J] 211 


$203 actividades ordinartas ( seja antes ou apos ||| ordinary activities ( elther before or after ||] 1 0.0272125 1 O.00705041 2.718 ||] 1-0 O-1 2-2 3-3 4-4 5-5 6-6 [Jf Dod 
$204 actividades ordindrtas ( seja antes ou |} ordinary activittes ( etther before or ||] 1 6.0395817 1 0.00834295 2.719 ||| 1-0 O-1 2-2 3-3 4-4 SeS [Jf 22d 

5205 actividades ordinartas ( seja antes ||| ordinary acttvitles ( etther before ||] 1 6.0468384 t 6.00907248 2.716 ||] 1-0 G+ 2-2 3-3 4-4 [J] tad 

5206 actividades ordinarias ( seja I]] ordinary activities ( either []] 1 6.60702576 1 0.0542987 2.718 [|] 1-0 O-3 2-2 9-3 [JP 191 


5207 actividades ordinarias ( ||| ordinary activittes ( [|] 1 0.0702576 1 0.705882 2.718 |] ] 1-0 O-f 2-2 [J] 24 
5208 actividades ordinartas ||| ordinary activities ||} 1 6.107143 1 0.75 2.718 [JJ 1-0 G-3 [JJ 133 


5209 actividades ov acontecinentos significativos desde o fim ||| activities or events since the end |[| 1 6.G0278081 1 6.039504 2.748 ||| O-@ B-1 2-2 3-2 4-3 5-4 6-5 [JJ 22a 
5210 actividades ou acontectmentos significativos desde o ||] activities or events stnce the jj] 1 0.0139041 1 0.276520 2,718 [I] 0-0 1-1 2-2 3-2 4-3 5-4 [JP dad 

$243 actividades ou acontecimentos significetivos desde j[| activities or events since ||| 1 0.0905432 1 0.422535 2.718 ||} O-O 1-1 2-2 3-2 4e3 [ff Bad 

$212 actividades ov acontecimentos significativos ||] activittes or events ||| 1 0.0905492 3 6.633903 2,718 ||| O-O 1-1 2-2 3-2 [ff] 248 


5243 actividades ou |}] activities 362173 1 6.633803 2.718 J [| O-@ 2-1 II} $44 
S214 actividades ||| activities of OS71 0.142857 O.0242624 2.728 [|] O-O IJ} 5 71 
52u5 actividades | {| activitles |{] 0.625 9.429571 0.714286 8.75 2.738 If O-8 JIL 8 7 5 
1 
| 


$216 actividades |[{ the activittes ||] 1 6.428571 6.142857 0.223595 2.718 [J] G-1 IN 
$237 activo de catxa ou seu equivalente que []] cash of 8 cash equivalent asset which ||] 10 
5218 activo de caixa ou seu equivalente ||| cash or a cash equivalent osset ||| 0.5 6.00037274 
5219 activo de catxa ou ||| cash or a |]| &.S 8.60216433 1 6.187793 2.718 ||| 2-0 3-1 O-2 TIT 
5220 activo de ||] @ |] 0.410989 6.00320841 1 6.333333 2.718 [I] 0-0 [J] or 42 

S221 activo nas demonstragdes financetras da enpresa que ||| asset in the financial statements of ||| 0.333333 3.05195e-08 1 0.026909 2.718 | f] © 
5222 activo nas demonstragdes financetras de enpresa ||| asset in the financial statements of ||| 0.333333 &.7387Se-07 1 0.020889 2.718 ||] 0-0 t- 
$223 activo nas denonstracgées financetras da []| asset in the financtal statenents of ||| 6.333333 9,00075060N 1 0.020880 2.718 |] O-@ 1-1 1-2 3- 
5224 activo nas demonstragdes financetras ||| asset tn the financlal statements [|] 1 &.00476437 1 0.6520833 2,718 [|] O-O 1-3 1-2 3-3 2-4 [PP 44 
$225 activo mas ||| asset tn the [fj] 1 9.0110601 1 0.0520835 2.718 [| O-0 1-2 1-2 [Jf 221 

5226 activo numa base ststemética durante a [|] asset on a systematic basis ||| 6.333339 4.99335e-08 § O.c0222222 2.728 ||] O- 
5227 acttvo numo base ststematica durante |}] asset on a systematic basts ||] 0.33233) 1.04617¢-06 t O.00222222 2.718 |]| O- 
$228 activo numa base ststematica [|] asset on a systematic basts ||| 0.333331 0.000898692 1 6.00222222 2.718 |] | O-0 I-t 2-2 
5229 activo numa |{| asset on [{] 1 O.00784313 1 0.112111 2.718 |[] O-0 1-1 [Jf rit 

5230 activo HI] @ [1] 0.010989 6.0164167 6.333333 0.333333 2.716 [1] 6-0 [fF] 99 34 

5232 activo [|] asset []] 1 0.660667 6.600067 0.606607 2.718 }{] 0-0 [[] 232 

$232 activos © passtvos a serem altenados ||| assets and Ulabilittes to be disposed of [||| 1 0.00276836 0.5 0,00164253 2.728 |] OO B-3 2+2 343 And SoS Jf] 2 23 


vi 

000226571 1 6.000902693 2.758 [|] 2-0 3-1 0-2 S-3 4-4 $-4 5-5 OO IPP 218 
©.00442611 2,718 [|] 2-0 2-1 O+? S=3 ded SoG SoS [JP 212 
! 


ie | 
it 
32 


Figure 7 — Extract of a phrase table of a 7-gram PT-EN training showing entries from 1 to 7 gram and the 
computed probabilities. 
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Vil. What a Language Model looks like 


The Language Model calculates the probability of a word after a given sentence or the probability of a given sentence 
and is generated during the training process. It tries to guarantee the fluency of the MT output, i.e. that “Ich gehe nach 
Haus’ is translated in the target language by “I am going home” and not by “I am going house” or something like that. 


Depending on the choice made, the Language Model may have entries with up to 9-grams, that is, a sequence of 9 
tokens (words, punctuation marks or symbols separated by a space). In MMM, the default is 7-grams. 


Just to have an idea, the same 3.4 million segment corpus can generate a 7-gram language model (vocabulary) with 
about 155 million entries. 


267 1ARPA 

268 Loadtxt_ram() 

269 1-grams: reading 276799 entries 
270 done levell 

271 2-grams: reading 3385018 entries 
272 done level2 

273 3-grams: reading 12891021 entries 


274 ..done levels 

27S 4-grams: reading 24366544 entries 
276 ....done level4d 

277 S-grams: reading 33019964 entries 
ta ait 6 o's) done levelS 

279 6-grams: reading 38648425 entries 
eo! ne done level6 

281 


282 7-grams: reading 42413877 entries 
Figure 8 — Number of entries in a 7-gram Language Model of the 3.4 million segment corpus PT-EN 
This is what a language model looks like: 


1 -8,005404 
f the -0.905404 


Figure 9 — Entries in a Language Model from 1-gram to 7-gram (here shown extracts with 1, 3 and 7 n-grams) 


Moses supports several different language model toolkits (SRILM, KenLM, IRSTLM, RandLM). In MMM, are available 
the IRSTLM and RANDLM language models. 


KenLM is not available in this version of MMM and SRILM Is not automatically downloaded and installed as it requires 
a licence for non-academic purposes. 


Training a Language Model from large/huge amounts of data can be memory and time expensive. 
The IRSTLM features algorithms and data structures suitable to estimate, store, and access very large LMs. 


RANDLM can be used to train really large LMs. It takes a very different approach to IRSTLM. It represents LMs using a 
randomized data structure. This can result in LMs that are ten times smaller than those created using the IRSTLM, but 
the quality of MT output can be lower, at least in our experience. 


a 
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Vill. Reordering and Recaser Models 


The Reordering Model will be used in the decoding (translation) process to try to get the words in a sentence translated 
in the correct order. For instance, to have the “the black cat” translated as “le chat noir’ in French and not as “le noir 
chat”. 


Before the training, all words in the training corpus are lowercased. The Recaser Model is used to recase words so 
that, for instance, “Union Européenne” is not translated as “european union”. 


IX. Tuning 


This is a process that can be done at the end of the training of a corpus and which aims to improve the quality of MT 
output. 


By translating a small tuning corpus (usually about 1000 to 2000 segments) repeatedly, the system will try to find the 
best weights between the components of a training (Phrase Table, Language Model and the other models) to achieve 
the best quality. 


MMM automatically installs 3 tuners: mert (MMM default), pro and kbmira and you can define the number of runs that 
will be done. 


A ‘run’ is the process by which Moses — the Decoder — translates the tuning corpus and afterwards scores it to see if 
it gets a better result. This operation is repeated as many times as defined in the training script. 


In MMM, the default is 25 runs, i.e., in the tuning stage, the tuning corpus will be translated up to 25 times, depending 
on the tuner used. 


This process takes quite some time and there is no assurance that the quality of MT output will be better than without 
it. 


X. The translation process 


Moses is the Decoder (the “translator”’) which translates new sentences by finding the highest scoring sentence in the 
target language in terms of exactness (according to the Translation Model) and fluency (according to the Language 
Model) from a list of candidate translations (the n-best list). 


Sample N-Best List 


e N-best list from Pharaoh: 


om Ill Reorcgering LY TH WordPenalty |!| Score 
27.0908 -3.63256 -S | 


& small house I!/ 0 -27.0 
this is a little house | 


‘a @ little house i 
thie house ie « esali |! 


talipp Koehn M utorial 4 April 


Figure 10 — Example of an n-best list 


= 
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It's like making a puzzle. Some pieces are easier to fit than others! 


In SMT, sometimes Moses translates segments amazingly well — even difficult ones —... and sometimes its translation 
is plain rubbish! 


It all depends on a large number of factors, an important one being the relevance of the data the system has been 
trained with to the document(s) to be translated. 


A large amount of data is important, but equally important is to train the system with relevant (in-domain) data. 


In MMM, you can select some settings that may, or not, improve the quality of MT output for a particular language pair 
and type of documents to translate. 


XI. Automatic scoring of MT output 


The golden rule for MT output evaluation is human evaluation and there is lots of literature on that on the Internet. 
But there is nothing better than training a corpus and see how useful it is — in real terms — for your translation work! 


However, as for research and production purposes, human evaluation is time-consuming and expensive, various 
metrics (tests) have been developed to automatically score MT output. 


The BLEU score is one of the most widely used and it indicates how closely the word sequences in one set of data 
correlate with (match) the word sequences in another set of data, such as a reference human translation. The higher 
the score, the better. 


This means that if MT output is identical or near the human translation, the score will reflect it. However, it doesn’t 
always work the other way round as there are generally many different ways of translating a sentence and, just 
because the MT translation is (very) different from the reference human translation, it doesn’t necessarily mean that it 
is worse or incorrect! 


AUTOMATIC EVALUATION 
BLEU SCORE — HOW IT WORKS ~ = 


i120 


Automatic Evaluation 


Reference Translation 
— the gunman was shot to death by the police - 


System Translations 

— the gunman was police kill . 
wounded police jaya of 
the gunman was shot dead by the police 
the gunman arrested by police kill . 
the gunmen were killed . 
the gunman was shot to death by the police 
gunmen were killed by police 7SUB>0O 7SUB>0 
al by the police 
the ringer is killed by the police 
police killed the gunman . 


Matches 
— green = 4 gram match (good!) 


— red = word not matched (bad!) 


ilipp Koehn utoria 


Figure 11 — The BLEU score — How it works 


Py 
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In MMM, the results are presented on a scale of 0 to 1 as generated by the scorer, but it is easier to think of it in a 
scale of 1 to 100. Therefore, a 0.5685 BLEU score may be read as a kind of percentage: 56.85. 


In MMM, it is also available the NIST score. 


To have an idea of the quality of MT output — as measured by the BLEU metrics, see the paper 462 Machine 
Translation Systems for Europe, by Philipp Koehn, Alexandra Birch and Ralf Steinberger (2009)2. 


SMT has evolved since then, but this gives an idea of how different the MT quality levels can be depending — among 
other factors — on the language combination. 


Target Language 
eo wy @&2#a ade asst nrt'-=_maeet tft w&F ma PrteawFs* at ws 


613 - 38.7 30.4 30.6 34.5 46.9 25.5 20.7 42.4 22.0 43.5 20.3 20.1 25.0 44,0 35.1 45.0 36.8 34.1 34.1 30.9 
53.6 26.3 - 35.4 43.1 32.8 47.1 26.7 20.5 WA 27.6 42.7 27.6 30.3 19.6 50.2 W.2 44.1 W.7 20.4 314 41.2 
56.4 32.0 42.6 - 43.0 34.0 46.9 30.7 30.5 41.0 27.4 44.3 34,5 35.6 26.3 40.5 39.2 45.7 30.5 43.6 41,3 42.9 
57.0 28.7 44.1 35,7 = 34,3 47.5 27,8 31.6 41.3 24,2 43.8 20.7 32.0 21.1 48.5 34.3 45.4 33.9 33.0 36.2 47.2 
80.8 32.4 43.1 37,7 44.5 - 58.0 26.5 20.0 48.3 23.7 49.6 29.0 32.6 23.8 486.0 34.2 52.5 37.2 33,1 36,3 43,3 
06.0 31.1 42.7 37.5 444 30.4 - 25.4 26.5 51.9 24.0 51.7 20.6 30.5 24.0 46.6 33.9 57.5 38.1 31.7 33.9 43.7 
d . / 37.7 33.4 30.9 37.0 35.0 36.0 20.5 41.3 32.0 37.8 28.0 30.6 32.9 37.3 
49.3 23.2 36,0 32.0 37.9 27.2 39.7 34.9 - 20.5 27.2 36.6 30.5 32.5 19.4 40,6 26.8 37.5 26.5 27.3 26.2 37.6 
04.0 34.5 45.1 39.5 47.4 42.8 00.9 26.7 30.0 - 25.5 50.1 26.3 31.9 25.3 51.0 35.7 O1.0 43.6 33.1 35.0 45.8 
48.0 24.7 34,3 30.0 33.0 25.5 34.1 29.6 20.4 W.7 - 33.5 20.6 31.9 18.1 36.1 20.6 34.2 25.7 25.6 26.2 30.5 
O10 32.1 44.3 38,9 45.86 40.6 26.9 25.0 20.7 S27) 24,2 - 29.4 32.6 24.6 50.5 35.2 56.5 30.3 325 34,7 443 
51.8 27.6 33.9 37.0 36.8 26.5 21.1 34.2 32.0 34.4 265 36.8 - 40.1 22.2 38.2 31.6 31.6 20.5 31.8 35,3 35,3 
54.0 20.1 35.0 37.6 36.5 29.7 6.0 34.2 32.4 35.6 20.3 36.9 36.4 - 23.3 41.5 344 3.6 31.0 33.3 37.1 36.0 
PRA 32.2 37.2 S79 W.9 337 48.7 26.9 25.8 42.4 22.4 43.7 30.2 33.2 - 44.0 37.1 45.9 38.9 35.8 40.0 41.6 
96.0 20.3 46.9 37,0 45,4 35.3 49,7 27.5 20.6 43.4 25.3 44.5 28.6 31.7 22.0 - 32.0 47,7 33.0 30,1 34,6 43,6 
ple 00.0 31.5 40.2 44.2 42.1 34.2 46.2 29.2 29.0 40.0 24.5 43.2 33.2 35.6 27.9 448 - 44.1 36.2 36.2 39.6 42.1 
pt OO.7 31.4 42.0 364 42.8 40.2 00.7 26.4 20.2 53.2 23.8 $2.6 28.0 31.5 24.8 40.3 34.5 - 39.4 32.1 344 43.9 
ro «660.8 33.1 38.5 37.6 40,3 35.6 50.4 24.0 26.2 46.5 25.0 44.8 26.4 20.9 26.7 43.0 35.8 46,5 - 31,5 35.1 304 
Sk - 
si 
w 


aR TAAFFHRSLSE LS ES 
2 
ec 
4 
J 
£ 
o 
é 
1 


00.8 32.6 30,4 48,1 41.0 33,3 46.2 29.8 28.4 30.4 27.4 41,8 33,8 36.7 28.5 44.4 39.0 43,3 353 42.6 41.6 
61.0 33,1 37.9 43.5 42.6 34.0 47.0 31.1 28.8 38.2 25.7 42.3 34.6 37.3 30.0 45.9 386.2 44,1 35.6 36.9 - 42.7 


hilipp Koehn, L 


dinburgh oN 23 March 2010 


Figure 12 — Translating between EU official languages 
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